TY - JOUR
T1 - Development and Performance of the Pulmonary Embolism Result Forecast Model (PERFORM) for Computed Tomography Clinical Decision Support
AU - Banerjee, Imon
AU - Sofela, Miji
AU - Yang, Jaden
AU - Chen, Jonathan H.
AU - Shah, Nigam H.
AU - Ball, Robyn
AU - Mushlin, Alvin I.
AU - Desai, Manisha
AU - Bledsoe, Joseph
AU - Amrhein, Timothy
AU - Rubin, Daniel L.
AU - Zamanian, Roham
AU - Lungren, Matthew P.
N1 - Funding Information:
Research reported in this publication was supported by the National Library of Medicine of the NIH under award R01LM012966, Stanford Child Health Research Institute (Stanford NIH-National Center for Advancing Translational Sciences-Clinical and Translational Science Awards grant UL1 TR001085), and Philips Healthcare. This research used data or services provided by STARR, "STAnford medicine Research data Repository," made possible by Stanford School of Medicine Research Office. This research used data and services provided by Duke Health's Analytics Center of Excellence (ACE)-Research Customer Solution Team. The Analytics Center of Excellence, is responsible for Duke University Health System's Data Warehouse and Engineering, Business Intelligence, Clinical and Operational Reporting, and Analytics Support for Clinical and Translational Research.
Funding Information:
Conflict of Interest Disclosures: Dr Ball reported receiving grants from the National Institutes of Health (NIH) during the conduct of the study and personal fees from Roam Analytics outside the submitted work. Dr Desai reported receiving grants from the NIH during the conduct of the study. Dr Bledsoe reported receiving personal fees from AMAG Pharmaceuticals and VendRx outside the submitted work. Dr Lungren reported receiving grants from the NIH National Library of Medicine and Philips Healthcare during the conduct of the study. No other disclosures were reported.
Funding Information:
Funding/Support: Research reported in this publication was supported by the National Library of Medicine of the NIH under award R01LM012966, Stanford Child Health Research Institute (Stanford NIH-National Center for Advancing Translational Sciences-Clinical and Translational Science Awards grant UL1 TR001085), and Philips Healthcare. This research used data or services provided by STARR, “STAnford medicine Research data Repository,” made possible by Stanford School of Medicine Research Office. This research used data and services provided by Duke Health’s Analytics Center of Excellence (ACE)-Research Customer Solution Team. The Analytics Center of Excellence, is responsible for Duke University Health System’s Data Warehouse and Engineering, Business Intelligence, Clinical and Operational Reporting, and Analytics Support for Clinical and Translational Research.
Publisher Copyright:
© 2019 Elwenspoek MMC et al. JAMA Network Open.
PY - 2019/8/7
Y1 - 2019/8/7
N2 - Importance: Pulmonary embolism (PE) is a life-threatening clinical problem, and computed tomographic imaging is the standard for diagnosis. Clinical decision support rules based on PE risk-scoring models have been developed to compute pretest probability but are underused and tend to underperform in practice, leading to persistent overuse of CT imaging for PE. Objective: To develop a machine learning model to generate a patient-specific risk score for PE by analyzing longitudinal clinical data as clinical decision support for patients referred for CT imaging for PE. Design, Setting, and Participants: In this diagnostic study, the proposed workflow for the machine learning model, the Pulmonary Embolism Result Forecast Model (PERFORM), transforms raw electronic medical record (EMR) data into temporal feature vectors and develops a decision analytical model targeted toward adult patients referred for CT imaging for PE. The model was tested on holdout patient EMR data from 2 large, academic medical practices. A total of 3397 annotated CT imaging examinations for PE from 3214 unique patients seen at Stanford University hospitals and clinics were used for training and validation. The models were externally validated on 240 unique patients seen at Duke University Medical Center. The comparison with clinical scoring systems was done on randomly selected 100 outpatient samples from Stanford University hospitals and clinics and 101 outpatient samples from Duke University Medical Center. Main Outcomes and Measures: Prediction performance of diagnosing acute PE was evaluated using ElasticNet, artificial neural networks, and other machine learning approaches on holdout data sets from both institutions, and performance of models was measured by area under the receiver operating characteristic curve (AUROC). Results: Of the 3214 patients included in the study, 1704 (53.0%) were women from Stanford University hospitals and clinics; mean (SD) age was 60.53 (19.43) years. The 240 patients from Duke University Medical Center used for validation included 132 women (55.0%); mean (SD) age was 70.2 (14.2) years. In the samples for clinical scoring system comparisons, the 100 outpatients from Stanford University hospitals and clinics included 67 women (67.0%); mean (SD) age was 57.74 (19.87) years, and the 101 patients from Duke University Medical Center included 59 women (58.4%); mean (SD) age was 73.06 (15.3) years. The best-performing model achieved an AUROC performance of predicting a positive PE study of 0.90 (95% CI, 0.87-0.91) on intrainstitutional holdout data with an AUROC of 0.71 (95% CI, 0.69-0.72) on an external data set from Duke University Medical Center; superior AUROC performance and cross-institutional generalization of the model of 0.81 (95% CI, 0.77-0.87) and 0.81 (95% CI, 0.73-0.82), respectively, were noted on holdout outpatient populations from both intrainstitutional and extrainstitutional data. Conclusions and Relevance: The machine learning model, PERFORM, may consider multitudes of applicable patient-specific risk factors and dependencies to arrive at a PE risk prediction that generalizes to new population distributions. This approach might be used as an automated clinical decision-support tool for patients referred for CT PE imaging to improve CT use.
AB - Importance: Pulmonary embolism (PE) is a life-threatening clinical problem, and computed tomographic imaging is the standard for diagnosis. Clinical decision support rules based on PE risk-scoring models have been developed to compute pretest probability but are underused and tend to underperform in practice, leading to persistent overuse of CT imaging for PE. Objective: To develop a machine learning model to generate a patient-specific risk score for PE by analyzing longitudinal clinical data as clinical decision support for patients referred for CT imaging for PE. Design, Setting, and Participants: In this diagnostic study, the proposed workflow for the machine learning model, the Pulmonary Embolism Result Forecast Model (PERFORM), transforms raw electronic medical record (EMR) data into temporal feature vectors and develops a decision analytical model targeted toward adult patients referred for CT imaging for PE. The model was tested on holdout patient EMR data from 2 large, academic medical practices. A total of 3397 annotated CT imaging examinations for PE from 3214 unique patients seen at Stanford University hospitals and clinics were used for training and validation. The models were externally validated on 240 unique patients seen at Duke University Medical Center. The comparison with clinical scoring systems was done on randomly selected 100 outpatient samples from Stanford University hospitals and clinics and 101 outpatient samples from Duke University Medical Center. Main Outcomes and Measures: Prediction performance of diagnosing acute PE was evaluated using ElasticNet, artificial neural networks, and other machine learning approaches on holdout data sets from both institutions, and performance of models was measured by area under the receiver operating characteristic curve (AUROC). Results: Of the 3214 patients included in the study, 1704 (53.0%) were women from Stanford University hospitals and clinics; mean (SD) age was 60.53 (19.43) years. The 240 patients from Duke University Medical Center used for validation included 132 women (55.0%); mean (SD) age was 70.2 (14.2) years. In the samples for clinical scoring system comparisons, the 100 outpatients from Stanford University hospitals and clinics included 67 women (67.0%); mean (SD) age was 57.74 (19.87) years, and the 101 patients from Duke University Medical Center included 59 women (58.4%); mean (SD) age was 73.06 (15.3) years. The best-performing model achieved an AUROC performance of predicting a positive PE study of 0.90 (95% CI, 0.87-0.91) on intrainstitutional holdout data with an AUROC of 0.71 (95% CI, 0.69-0.72) on an external data set from Duke University Medical Center; superior AUROC performance and cross-institutional generalization of the model of 0.81 (95% CI, 0.77-0.87) and 0.81 (95% CI, 0.73-0.82), respectively, were noted on holdout outpatient populations from both intrainstitutional and extrainstitutional data. Conclusions and Relevance: The machine learning model, PERFORM, may consider multitudes of applicable patient-specific risk factors and dependencies to arrive at a PE risk prediction that generalizes to new population distributions. This approach might be used as an automated clinical decision-support tool for patients referred for CT PE imaging to improve CT use.
UR - http://www.scopus.com/inward/record.url?scp=85070545629&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85070545629&partnerID=8YFLogxK
U2 - 10.1001/jamanetworkopen.2019.8719
DO - 10.1001/jamanetworkopen.2019.8719
M3 - Article
C2 - 31390040
AN - SCOPUS:85070545629
SN - 2574-3805
VL - 2
JO - JAMA Network Open
JF - JAMA Network Open
IS - 8
M1 - e198719
ER -