Application of a natural language processing algorithm to asthma ascertainment: An automated chart review

Chung Il Wi, Sunghwan Sohn, Mary C. Rolfes, Alicia Seabright, Euijung Ryu, Gretchen Voge, Kay A. Bachman, Miguel A. Park, Hirohito Kita, Ivana T. Croghan, Hongfang Liu, Young J. Juhn

Research output: Contribution to journalArticlepeer-review

29 Scopus citations


Rationale: Difficulty of asthma ascertainment and its associated methodologic heterogeneity have created significant barriers to asthma care and research. Objectives: We evaluated the validity of an existing natural language processing (NLP) algorithm for asthma criteria to enable an automated chart review using electronic medical records (EMRs). Methods: The study was designed as a retrospective birth cohort study using a random sample of 500 subjects from the 1997-2007 Mayo Birth Cohort who were born at Mayo Clinic and enrolled in primary pediatric care at Mayo Clinic Rochester. Performance of NLP-based asthma ascertainment using predetermined asthma criteria was assessed by determining both criterion validity (chart review of EMRs by abstractor as a gold standard) and construct validity (association with known risk factors for asthma, such as allergic rhinitis). Measurements and Main Results: After excluding three subjects whose respiratory symptoms could be attributed to other conditions (e.g., tracheomalacia), among the remaining eligible 497 subjects, 51% were male, 77% white persons, and the median age at last follow-up date was 11.5 years. The asthma prevalence was 31% in the study cohort. Sensitivity, specificity, positive predictive value, and negative predictive value for NLP algorithm in predicting asthma status were 97%, 95%, 90%, and 98%, respectively. The risk factors for asthma (e.g., allergic rhinitis) that were identified either by NLP or the abstractor were the same. Conclusions: Asthma ascertainment through NLP should be considered in the era of EMRs because it can enable large-scale clinical studies in a more time-efficient manner and improve the recognition and care of childhood asthma in practice.

Original languageEnglish (US)
Pages (from-to)430-437
Number of pages8
JournalAmerican journal of respiratory and critical care medicine
Issue number4
StatePublished - Aug 15 2017


  • Electronic medical records
  • Informatics
  • Retrospective study

ASJC Scopus subject areas

  • Pulmonary and Respiratory Medicine
  • Critical Care and Intensive Care Medicine


Dive into the research topics of 'Application of a natural language processing algorithm to asthma ascertainment: An automated chart review'. Together they form a unique fingerprint.

Cite this