Automated chart review utilizing natural language processing algorithm for asthma predictive index

Harsheen Kaur, Sunghwan Sohn, Chung Il Wi, Euijung Ryu, Miguel A. Park, Kay Bachman, Hirohito Kita, Ivana Croghan, Jose A. Castro-Rodriguez, Gretchen A. Voge, Hongfang Liu, Young J. Juhn

Research output: Contribution to journalArticlepeer-review

14 Scopus citations


Background: Thus far, no algorithms have been developed to automatically extract patients who meet Asthma Predictive Index (API) criteria from the Electronic health records (EHR) yet. Our objective is to develop and validate a natural language processing (NLP) algorithm to identify patients that meet API criteria. Methods: This is a cross-sectional study nested in a birth cohort study in Olmsted County, MN. Asthma status ascertained by manual chart review based on API criteria served as gold standard. NLP-API was developed on a training cohort (n=87) and validated on a test cohort (n=427). Criterion validity was measured by sensitivity, specificity, positive predictive value and negative predictive value of the NLP algorithm against manual chart review for asthma status. Construct validity was determined by associations of asthma status defined by NLP-API with known risk factors for asthma. Results: Among the eligible 427 subjects of the test cohort, 48% were males and 74% were White. Median age was 5.3years (interquartile range 3.6-6.8). 35 (8%) had a history of asthma by NLP-API vs. 36 (8%) by abstractor with 31 by both approaches. NLP-API predicted asthma status with sensitivity 86%, specificity 98%, positive predictive value 88%, negative predictive value 98%. Asthma status by both NLP and manual chart review were significantly associated with the known asthma risk factors, such as history of allergic rhinitis, eczema, family history of asthma, and maternal history of smoking during pregnancy (p value <0.05). Maternal smoking [odds ratio: 4.4, 95% confidence interval 1.8-10.7] was associated with asthma status determined by NLP-API and abstractor, and the effect sizes were similar between the reviews with 4.4 vs 4.2 respectively. Conclusion: NLP-API was able to ascertain asthma status in children mining from EHR and has a potential to enhance asthma care and research through population management and large-scale studies when identifying children who meet API criteria.

Original languageEnglish (US)
Article number34
JournalBMC Pulmonary Medicine
Issue number1
StatePublished - Feb 13 2018


  • API
  • Asthma
  • Epidemiology
  • Informatics
  • NLP

ASJC Scopus subject areas

  • Pulmonary and Respiratory Medicine


Dive into the research topics of 'Automated chart review utilizing natural language processing algorithm for asthma predictive index'. Together they form a unique fingerprint.

Cite this