Natural language processing of head CT reports to identify intracranial mass effect: CTIME algorithm

Alexandra June Gordon; Imon Banerjee; Jason Block; Christopher Winstead-Derlega; Jennifer G. Wilson; Tsuyoshi Mitarai; Michael Jarrett; Josh Sanyal; Daniel L. Rubin; Max Wintermark; Michael A. Kohn

doi:10.1016/j.ajem.2021.11.001

Natural language processing of head CT reports to identify intracranial mass effect: CTIME algorithm

Alexandra June Gordon, Imon Banerjee, Jason Block, Christopher Winstead-Derlega, Jennifer G. Wilson, Tsuyoshi Mitarai, Michael Jarrett, Josh Sanyal, Daniel L. Rubin, Max Wintermark, Michael A. Kohn

Diagnostic Radiology

Research output: Contribution to journal › Article › peer-review

Abstract

Background: The Mortality Probability Model (MPM) is used in research and quality improvement to adjust for severity of illness and can also inform triage decisions. However, a limitation for its automated use or application is that it includes the variable “intracranial mass effect” (IME), which requires human engagement with the electronic health record (EHR). We developed and tested a natural language processing (NLP) algorithm to identify IME from CT head reports. Methods: We obtained initial CT head reports from adult patients who were admitted to the ICU from our ED between 10/2013 and 9/2016. Each head CT head report was labeled yes/no IME by at least two of five independent labelers. The reports were then randomly divided 80/20 into training and test sets. All reports were preprocessed to remove linguistic and style variability, and a dictionary was created to map similar common terms. We tested three vectorization strategies: Term Frequency-Inverse Document frequency (TF-IDF), Word2Vec, and Universal Sentence Encoder to convert the report text to a numerical vector. This vector served as the input to a classification-tree-based ensemble machine learning algorithm (XGBoost). After training, model performance was assessed in the test set using the area under the receiver operating characteristic curve (AUROC). We also divided the continuous range of scores into positive/inconclusive/negative categories for IME. Results: Of the 1202 CT reports in the training set, 308 (25.6%) reports were manually labeled as “yes” for IME. Of the 355 reports in the test set, 108 (30.4%) were labeled as “yes” for IME. The TF-IDF vectorization strategy as an input for the XGBoost model had the best AUROC:– 0.9625 (95% CI 0.9443–0.9807). TF-IDF score categories were defined and had the following likelihood ratios: “positive” (TF-IDF score > 0.5) LR = 24.59; “inconclusive” (TF-IDF 0.05–0.5) LR = 0.99; and “negative” (TF-IDF < 0.05) LR = 0.05. 82% of reports were classified as either “positive” or “negative”. In the test set, only 4 of 199 (2.0%) reports with a “negative” classification were false negatives and only 8 of 93 (8.6%) reports classified as “positive” were false positives. Conclusion: NLP can accurately identify IME from free-text reports of head CTs in approximately 80% of records, adequate to allow automatic calculation of MPM based on EHR data for many applications.

Original language	English (US)
Pages (from-to)	388-392
Number of pages	5
Journal	American Journal of Emergency Medicine
Volume	51
DOIs	https://doi.org/10.1016/j.ajem.2021.11.001
State	Published - Jan 2022

Keywords

Artificial intelligence
Emergency critical care
Hospital mortality
Natural language processing

ASJC Scopus subject areas

Emergency Medicine

Access to Document

10.1016/j.ajem.2021.11.001

Cite this

@article{0222b1e47ea14f5bbbe489e377e50266,

title = "Natural language processing of head CT reports to identify intracranial mass effect: CTIME algorithm",

abstract = "Background: The Mortality Probability Model (MPM) is used in research and quality improvement to adjust for severity of illness and can also inform triage decisions. However, a limitation for its automated use or application is that it includes the variable “intracranial mass effect” (IME), which requires human engagement with the electronic health record (EHR). We developed and tested a natural language processing (NLP) algorithm to identify IME from CT head reports. Methods: We obtained initial CT head reports from adult patients who were admitted to the ICU from our ED between 10/2013 and 9/2016. Each head CT head report was labeled yes/no IME by at least two of five independent labelers. The reports were then randomly divided 80/20 into training and test sets. All reports were preprocessed to remove linguistic and style variability, and a dictionary was created to map similar common terms. We tested three vectorization strategies: Term Frequency-Inverse Document frequency (TF-IDF), Word2Vec, and Universal Sentence Encoder to convert the report text to a numerical vector. This vector served as the input to a classification-tree-based ensemble machine learning algorithm (XGBoost). After training, model performance was assessed in the test set using the area under the receiver operating characteristic curve (AUROC). We also divided the continuous range of scores into positive/inconclusive/negative categories for IME. Results: Of the 1202 CT reports in the training set, 308 (25.6%) reports were manually labeled as “yes” for IME. Of the 355 reports in the test set, 108 (30.4%) were labeled as “yes” for IME. The TF-IDF vectorization strategy as an input for the XGBoost model had the best AUROC:– 0.9625 (95% CI 0.9443–0.9807). TF-IDF score categories were defined and had the following likelihood ratios: “positive” (TF-IDF score > 0.5) LR = 24.59; “inconclusive” (TF-IDF 0.05–0.5) LR = 0.99; and “negative” (TF-IDF < 0.05) LR = 0.05. 82% of reports were classified as either “positive” or “negative”. In the test set, only 4 of 199 (2.0%) reports with a “negative” classification were false negatives and only 8 of 93 (8.6%) reports classified as “positive” were false positives. Conclusion: NLP can accurately identify IME from free-text reports of head CTs in approximately 80% of records, adequate to allow automatic calculation of MPM based on EHR data for many applications.",

keywords = "Artificial intelligence, Emergency critical care, Hospital mortality, Natural language processing",

author = "Gordon, {Alexandra June} and Imon Banerjee and Jason Block and Christopher Winstead-Derlega and Wilson, {Jennifer G.} and Tsuyoshi Mitarai and Michael Jarrett and Josh Sanyal and Rubin, {Daniel L.} and Max Wintermark and Kohn, {Michael A.}",

note = "Publisher Copyright: {\textcopyright} 2021 Elsevier Inc.",

year = "2022",

month = jan,

doi = "10.1016/j.ajem.2021.11.001",

language = "English (US)",

volume = "51",

pages = "388--392",

journal = "American Journal of Emergency Medicine",

issn = "0735-6757",

publisher = "W.B. Saunders Ltd",

}

TY - JOUR

T1 - Natural language processing of head CT reports to identify intracranial mass effect

T2 - CTIME algorithm

AU - Gordon, Alexandra June

AU - Banerjee, Imon

AU - Block, Jason

AU - Winstead-Derlega, Christopher

AU - Wilson, Jennifer G.

AU - Mitarai, Tsuyoshi

AU - Jarrett, Michael

AU - Sanyal, Josh

AU - Rubin, Daniel L.

AU - Wintermark, Max

AU - Kohn, Michael A.

PY - 2022/1

Y1 - 2022/1

N2 - Background: The Mortality Probability Model (MPM) is used in research and quality improvement to adjust for severity of illness and can also inform triage decisions. However, a limitation for its automated use or application is that it includes the variable “intracranial mass effect” (IME), which requires human engagement with the electronic health record (EHR). We developed and tested a natural language processing (NLP) algorithm to identify IME from CT head reports. Methods: We obtained initial CT head reports from adult patients who were admitted to the ICU from our ED between 10/2013 and 9/2016. Each head CT head report was labeled yes/no IME by at least two of five independent labelers. The reports were then randomly divided 80/20 into training and test sets. All reports were preprocessed to remove linguistic and style variability, and a dictionary was created to map similar common terms. We tested three vectorization strategies: Term Frequency-Inverse Document frequency (TF-IDF), Word2Vec, and Universal Sentence Encoder to convert the report text to a numerical vector. This vector served as the input to a classification-tree-based ensemble machine learning algorithm (XGBoost). After training, model performance was assessed in the test set using the area under the receiver operating characteristic curve (AUROC). We also divided the continuous range of scores into positive/inconclusive/negative categories for IME. Results: Of the 1202 CT reports in the training set, 308 (25.6%) reports were manually labeled as “yes” for IME. Of the 355 reports in the test set, 108 (30.4%) were labeled as “yes” for IME. The TF-IDF vectorization strategy as an input for the XGBoost model had the best AUROC:– 0.9625 (95% CI 0.9443–0.9807). TF-IDF score categories were defined and had the following likelihood ratios: “positive” (TF-IDF score > 0.5) LR = 24.59; “inconclusive” (TF-IDF 0.05–0.5) LR = 0.99; and “negative” (TF-IDF < 0.05) LR = 0.05. 82% of reports were classified as either “positive” or “negative”. In the test set, only 4 of 199 (2.0%) reports with a “negative” classification were false negatives and only 8 of 93 (8.6%) reports classified as “positive” were false positives. Conclusion: NLP can accurately identify IME from free-text reports of head CTs in approximately 80% of records, adequate to allow automatic calculation of MPM based on EHR data for many applications.

AB - Background: The Mortality Probability Model (MPM) is used in research and quality improvement to adjust for severity of illness and can also inform triage decisions. However, a limitation for its automated use or application is that it includes the variable “intracranial mass effect” (IME), which requires human engagement with the electronic health record (EHR). We developed and tested a natural language processing (NLP) algorithm to identify IME from CT head reports. Methods: We obtained initial CT head reports from adult patients who were admitted to the ICU from our ED between 10/2013 and 9/2016. Each head CT head report was labeled yes/no IME by at least two of five independent labelers. The reports were then randomly divided 80/20 into training and test sets. All reports were preprocessed to remove linguistic and style variability, and a dictionary was created to map similar common terms. We tested three vectorization strategies: Term Frequency-Inverse Document frequency (TF-IDF), Word2Vec, and Universal Sentence Encoder to convert the report text to a numerical vector. This vector served as the input to a classification-tree-based ensemble machine learning algorithm (XGBoost). After training, model performance was assessed in the test set using the area under the receiver operating characteristic curve (AUROC). We also divided the continuous range of scores into positive/inconclusive/negative categories for IME. Results: Of the 1202 CT reports in the training set, 308 (25.6%) reports were manually labeled as “yes” for IME. Of the 355 reports in the test set, 108 (30.4%) were labeled as “yes” for IME. The TF-IDF vectorization strategy as an input for the XGBoost model had the best AUROC:– 0.9625 (95% CI 0.9443–0.9807). TF-IDF score categories were defined and had the following likelihood ratios: “positive” (TF-IDF score > 0.5) LR = 24.59; “inconclusive” (TF-IDF 0.05–0.5) LR = 0.99; and “negative” (TF-IDF < 0.05) LR = 0.05. 82% of reports were classified as either “positive” or “negative”. In the test set, only 4 of 199 (2.0%) reports with a “negative” classification were false negatives and only 8 of 93 (8.6%) reports classified as “positive” were false positives. Conclusion: NLP can accurately identify IME from free-text reports of head CTs in approximately 80% of records, adequate to allow automatic calculation of MPM based on EHR data for many applications.

KW - Artificial intelligence

KW - Emergency critical care

KW - Hospital mortality

KW - Natural language processing

UR - http://www.scopus.com/inward/record.url?scp=85119905878&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85119905878&partnerID=8YFLogxK

U2 - 10.1016/j.ajem.2021.11.001

DO - 10.1016/j.ajem.2021.11.001

M3 - Article

C2 - 34839182

AN - SCOPUS:85119905878

SN - 0735-6757

VL - 51

SP - 388

EP - 392

JO - American Journal of Emergency Medicine

JF - American Journal of Emergency Medicine

ER -

Natural language processing of head CT reports to identify intracranial mass effect: CTIME algorithm

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this