Comparative effectiveness of convolutional neural network (CNN) and recurrent neural network (RNN) architectures for radiology text report classification

Imon Banerjee; Yuan Ling; Matthew C. Chen; Sadid A. Hasan; Curtis P. Langlotz; Nathaniel Moradzadeh; Brian Chapman; Timothy Amrhein; David Mong; Daniel L. Rubin; Oladimeji Farri; Matthew P. Lungren

doi:10.1016/j.artmed.2018.11.004

Comparative effectiveness of convolutional neural network (CNN) and recurrent neural network (RNN) architectures for radiology text report classification

Imon Banerjee, Yuan Ling, Matthew C. Chen, Sadid A. Hasan, Curtis P. Langlotz, Nathaniel Moradzadeh, Brian Chapman, Timothy Amrhein, David Mong, Daniel L. Rubin, Oladimeji Farri, Matthew P. Lungren

Diagnostic Radiology

Research output: Contribution to journal › Article › peer-review

Abstract

This paper explores cutting-edge deep learning methods for information extraction from medical imaging free text reports at a multi-institutional scale and compares them to the state-of-the-art domain-specific rule-based system – PEFinder and traditional machine learning methods – SVM and Adaboost. We proposed two distinct deep learning models – (i) CNN Word – Glove, and (ii) Domain phrase attention-based hierarchical recurrent neural network (DPA-HNN), for synthesizing information on pulmonary emboli (PE) from over 7370 clinical thoracic computed tomography (CT) free-text radiology reports collected from four major healthcare centers. Our proposed DPA-HNN model encodes domain-dependent phrases into an attention mechanism and represents a radiology report through a hierarchical RNN structure composed of word-level, sentence-level and document-level representations. Experimental results suggest that the performance of the deep learning models that are trained on a single institutional dataset, are better than rule-based PEFinder on our multi-institutional test sets. The best F1 score for the presence of PE in an adult patient population was 0.99 (DPA-HNN) and for a pediatrics population was 0.99 (HNN) which shows that the deep learning models being trained on adult data, demonstrated generalizability to pediatrics population with comparable accuracy. Our work suggests feasibility of broader usage of neural network models in automated classification of multi-institutional imaging text reports for a variety of applications including evaluation of imaging utilization, imaging yield, clinical decision support tools, and as part of automated classification of large corpus for medical imaging deep learning work.

Original language	English (US)
Pages (from-to)	79-88
Number of pages	10
Journal	Artificial Intelligence in Medicine
Volume	97
DOIs	https://doi.org/10.1016/j.artmed.2018.11.004
State	Published - Jun 2019

Keywords

Convolutional neural network (CNN)
Pulmonary embolism
Radiology report analysis
Recurrent neural network (RNN)
Text report classification

ASJC Scopus subject areas

Medicine (miscellaneous)
Artificial Intelligence

Access to Document

10.1016/j.artmed.2018.11.004

Cite this

Banerjee, I., Ling, Y., Chen, M. C., Hasan, S. A., Langlotz, C. P., Moradzadeh, N., Chapman, B., Amrhein, T., Mong, D., Rubin, D. L., Farri, O., & Lungren, M. P. (2019). Comparative effectiveness of convolutional neural network (CNN) and recurrent neural network (RNN) architectures for radiology text report classification. Artificial Intelligence in Medicine, 97, 79-88. https://doi.org/10.1016/j.artmed.2018.11.004

Banerjee, I, Ling, Y, Chen, MC, Hasan, SA, Langlotz, CP, Moradzadeh, N, Chapman, B, Amrhein, T, Mong, D, Rubin, DL, Farri, O & Lungren, MP 2019, 'Comparative effectiveness of convolutional neural network (CNN) and recurrent neural network (RNN) architectures for radiology text report classification', Artificial Intelligence in Medicine, vol. 97, pp. 79-88. https://doi.org/10.1016/j.artmed.2018.11.004

@article{7f61fd49c78744d099de7885f015aa9d,

title = "Comparative effectiveness of convolutional neural network (CNN) and recurrent neural network (RNN) architectures for radiology text report classification",

abstract = "This paper explores cutting-edge deep learning methods for information extraction from medical imaging free text reports at a multi-institutional scale and compares them to the state-of-the-art domain-specific rule-based system – PEFinder and traditional machine learning methods – SVM and Adaboost. We proposed two distinct deep learning models – (i) CNN Word – Glove, and (ii) Domain phrase attention-based hierarchical recurrent neural network (DPA-HNN), for synthesizing information on pulmonary emboli (PE) from over 7370 clinical thoracic computed tomography (CT) free-text radiology reports collected from four major healthcare centers. Our proposed DPA-HNN model encodes domain-dependent phrases into an attention mechanism and represents a radiology report through a hierarchical RNN structure composed of word-level, sentence-level and document-level representations. Experimental results suggest that the performance of the deep learning models that are trained on a single institutional dataset, are better than rule-based PEFinder on our multi-institutional test sets. The best F1 score for the presence of PE in an adult patient population was 0.99 (DPA-HNN) and for a pediatrics population was 0.99 (HNN) which shows that the deep learning models being trained on adult data, demonstrated generalizability to pediatrics population with comparable accuracy. Our work suggests feasibility of broader usage of neural network models in automated classification of multi-institutional imaging text reports for a variety of applications including evaluation of imaging utilization, imaging yield, clinical decision support tools, and as part of automated classification of large corpus for medical imaging deep learning work.",

keywords = "Convolutional neural network (CNN), Pulmonary embolism, Radiology report analysis, Recurrent neural network (RNN), Text report classification",

author = "Imon Banerjee and Yuan Ling and Chen, {Matthew C.} and Hasan, {Sadid A.} and Langlotz, {Curtis P.} and Nathaniel Moradzadeh and Brian Chapman and Timothy Amrhein and David Mong and Rubin, {Daniel L.} and Oladimeji Farri and Lungren, {Matthew P.}",

note = "Publisher Copyright: {\textcopyright} 2018 Elsevier B.V.",

year = "2019",

month = jun,

doi = "10.1016/j.artmed.2018.11.004",

language = "English (US)",

volume = "97",

pages = "79--88",

journal = "Artificial Intelligence in Medicine",

issn = "0933-3657",

publisher = "Elsevier",

}

TY - JOUR

T1 - Comparative effectiveness of convolutional neural network (CNN) and recurrent neural network (RNN) architectures for radiology text report classification

AU - Banerjee, Imon

AU - Ling, Yuan

AU - Chen, Matthew C.

AU - Hasan, Sadid A.

AU - Langlotz, Curtis P.

AU - Moradzadeh, Nathaniel

AU - Chapman, Brian

AU - Amrhein, Timothy

AU - Mong, David

AU - Rubin, Daniel L.

AU - Farri, Oladimeji

AU - Lungren, Matthew P.

PY - 2019/6

Y1 - 2019/6

N2 - This paper explores cutting-edge deep learning methods for information extraction from medical imaging free text reports at a multi-institutional scale and compares them to the state-of-the-art domain-specific rule-based system – PEFinder and traditional machine learning methods – SVM and Adaboost. We proposed two distinct deep learning models – (i) CNN Word – Glove, and (ii) Domain phrase attention-based hierarchical recurrent neural network (DPA-HNN), for synthesizing information on pulmonary emboli (PE) from over 7370 clinical thoracic computed tomography (CT) free-text radiology reports collected from four major healthcare centers. Our proposed DPA-HNN model encodes domain-dependent phrases into an attention mechanism and represents a radiology report through a hierarchical RNN structure composed of word-level, sentence-level and document-level representations. Experimental results suggest that the performance of the deep learning models that are trained on a single institutional dataset, are better than rule-based PEFinder on our multi-institutional test sets. The best F1 score for the presence of PE in an adult patient population was 0.99 (DPA-HNN) and for a pediatrics population was 0.99 (HNN) which shows that the deep learning models being trained on adult data, demonstrated generalizability to pediatrics population with comparable accuracy. Our work suggests feasibility of broader usage of neural network models in automated classification of multi-institutional imaging text reports for a variety of applications including evaluation of imaging utilization, imaging yield, clinical decision support tools, and as part of automated classification of large corpus for medical imaging deep learning work.

AB - This paper explores cutting-edge deep learning methods for information extraction from medical imaging free text reports at a multi-institutional scale and compares them to the state-of-the-art domain-specific rule-based system – PEFinder and traditional machine learning methods – SVM and Adaboost. We proposed two distinct deep learning models – (i) CNN Word – Glove, and (ii) Domain phrase attention-based hierarchical recurrent neural network (DPA-HNN), for synthesizing information on pulmonary emboli (PE) from over 7370 clinical thoracic computed tomography (CT) free-text radiology reports collected from four major healthcare centers. Our proposed DPA-HNN model encodes domain-dependent phrases into an attention mechanism and represents a radiology report through a hierarchical RNN structure composed of word-level, sentence-level and document-level representations. Experimental results suggest that the performance of the deep learning models that are trained on a single institutional dataset, are better than rule-based PEFinder on our multi-institutional test sets. The best F1 score for the presence of PE in an adult patient population was 0.99 (DPA-HNN) and for a pediatrics population was 0.99 (HNN) which shows that the deep learning models being trained on adult data, demonstrated generalizability to pediatrics population with comparable accuracy. Our work suggests feasibility of broader usage of neural network models in automated classification of multi-institutional imaging text reports for a variety of applications including evaluation of imaging utilization, imaging yield, clinical decision support tools, and as part of automated classification of large corpus for medical imaging deep learning work.

KW - Convolutional neural network (CNN)

KW - Pulmonary embolism

KW - Radiology report analysis

KW - Recurrent neural network (RNN)

KW - Text report classification

UR - http://www.scopus.com/inward/record.url?scp=85057000826&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85057000826&partnerID=8YFLogxK

U2 - 10.1016/j.artmed.2018.11.004

DO - 10.1016/j.artmed.2018.11.004

M3 - Article

C2 - 30477892

AN - SCOPUS:85057000826

SN - 0933-3657

VL - 97

SP - 79

EP - 88

JO - Artificial Intelligence in Medicine

JF - Artificial Intelligence in Medicine

ER -

Comparative effectiveness of convolutional neural network (CNN) and recurrent neural network (RNN) architectures for radiology text report classification

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this