TY - JOUR
T1 - Comparative effectiveness of convolutional neural network (CNN) and recurrent neural network (RNN) architectures for radiology text report classification
AU - Banerjee, Imon
AU - Ling, Yuan
AU - Chen, Matthew C.
AU - Hasan, Sadid A.
AU - Langlotz, Curtis P.
AU - Moradzadeh, Nathaniel
AU - Chapman, Brian
AU - Amrhein, Timothy
AU - Mong, David
AU - Rubin, Daniel L.
AU - Farri, Oladimeji
AU - Lungren, Matthew P.
N1 - Funding Information:
Financial support for this project was provided by grants from Philips Healthcare , Stanford Child Health Research Institute (Stanford NIH-NCATS-CTSA Grant #UL1 TR001085 ) and National Library of Medicine #R01LM01296601 .
Publisher Copyright:
© 2018 Elsevier B.V.
PY - 2019/6
Y1 - 2019/6
N2 - This paper explores cutting-edge deep learning methods for information extraction from medical imaging free text reports at a multi-institutional scale and compares them to the state-of-the-art domain-specific rule-based system – PEFinder and traditional machine learning methods – SVM and Adaboost. We proposed two distinct deep learning models – (i) CNN Word – Glove, and (ii) Domain phrase attention-based hierarchical recurrent neural network (DPA-HNN), for synthesizing information on pulmonary emboli (PE) from over 7370 clinical thoracic computed tomography (CT) free-text radiology reports collected from four major healthcare centers. Our proposed DPA-HNN model encodes domain-dependent phrases into an attention mechanism and represents a radiology report through a hierarchical RNN structure composed of word-level, sentence-level and document-level representations. Experimental results suggest that the performance of the deep learning models that are trained on a single institutional dataset, are better than rule-based PEFinder on our multi-institutional test sets. The best F1 score for the presence of PE in an adult patient population was 0.99 (DPA-HNN) and for a pediatrics population was 0.99 (HNN) which shows that the deep learning models being trained on adult data, demonstrated generalizability to pediatrics population with comparable accuracy. Our work suggests feasibility of broader usage of neural network models in automated classification of multi-institutional imaging text reports for a variety of applications including evaluation of imaging utilization, imaging yield, clinical decision support tools, and as part of automated classification of large corpus for medical imaging deep learning work.
AB - This paper explores cutting-edge deep learning methods for information extraction from medical imaging free text reports at a multi-institutional scale and compares them to the state-of-the-art domain-specific rule-based system – PEFinder and traditional machine learning methods – SVM and Adaboost. We proposed two distinct deep learning models – (i) CNN Word – Glove, and (ii) Domain phrase attention-based hierarchical recurrent neural network (DPA-HNN), for synthesizing information on pulmonary emboli (PE) from over 7370 clinical thoracic computed tomography (CT) free-text radiology reports collected from four major healthcare centers. Our proposed DPA-HNN model encodes domain-dependent phrases into an attention mechanism and represents a radiology report through a hierarchical RNN structure composed of word-level, sentence-level and document-level representations. Experimental results suggest that the performance of the deep learning models that are trained on a single institutional dataset, are better than rule-based PEFinder on our multi-institutional test sets. The best F1 score for the presence of PE in an adult patient population was 0.99 (DPA-HNN) and for a pediatrics population was 0.99 (HNN) which shows that the deep learning models being trained on adult data, demonstrated generalizability to pediatrics population with comparable accuracy. Our work suggests feasibility of broader usage of neural network models in automated classification of multi-institutional imaging text reports for a variety of applications including evaluation of imaging utilization, imaging yield, clinical decision support tools, and as part of automated classification of large corpus for medical imaging deep learning work.
KW - Convolutional neural network (CNN)
KW - Pulmonary embolism
KW - Radiology report analysis
KW - Recurrent neural network (RNN)
KW - Text report classification
UR - http://www.scopus.com/inward/record.url?scp=85057000826&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85057000826&partnerID=8YFLogxK
U2 - 10.1016/j.artmed.2018.11.004
DO - 10.1016/j.artmed.2018.11.004
M3 - Article
C2 - 30477892
AN - SCOPUS:85057000826
SN - 0933-3657
VL - 97
SP - 79
EP - 88
JO - Artificial Intelligence in Medicine
JF - Artificial Intelligence in Medicine
ER -