Expert artificial intelligence-based natural language processing characterises childhood asthma

Hee Yun Seol; Mary C. Rolfes; Wi Chung; Sunghwan Sohn; Euijung Ryu; Miguel A. Park; Hirohito Kita; Junya Ono; Ivana Croghan; Sebastian M. Armasu; Jose A. Castro-Rodriguez; Jill D. Weston; Hongfang Liu; Young Juhn

doi:10.1136/bmjresp-2019-000524

Expert artificial intelligence-based natural language processing characterises childhood asthma

Hee Yun Seol, Mary C. Rolfes, Wi Chung, Sunghwan Sohn, Euijung Ryu, Miguel A. Park, Hirohito Kita, Junya Ono, Ivana Croghan, Sebastian M. Armasu, Jose A. Castro-Rodriguez, Jill D. Weston, Hongfang Liu, Young Juhn

Research output: Contribution to journal › Article › peer-review

2 Scopus citations

Abstract

Introduction The lack of effective, consistent, reproducible and efficient asthma ascertainment methods results in inconsistent asthma cohorts and study results for clinical trials or other studies. We aimed to assess whether application of expert artificial intelligence (AI)-based natural language processing (NLP) algorithms for two existing asthma criteria to electronic health records of a paediatric population systematically identifies childhood asthma and its subgroups with distinctive characteristics. Methods Using the 1997-2007 Olmsted County Birth Cohort, we applied validated NLP algorithms for Predetermined Asthma Criteria (NLP-PAC) as well as Asthma Predictive Index (NLP-API). We categorised subjects into four groups (both criteria positive (NLP-PAC + /NLP-API +); PAC positive only (NLP-PAC + only); API positive only (NLP-API + only); and both criteria negative (NLP-PAC - /NLP-API -)) and characterised them. Results were replicated in unsupervised cluster analysis for asthmatics and a random sample of 300 children using laboratory and pulmonary function tests (PFTs). Results Of the 8196 subjects (51% male, 80% white), we identified 1614 (20%), NLP-PAC + /NLP-API +; 954 (12%), NLP-PAC + only; 105 (1%), NLP-API + only; and 5523 (67%), NLP-PAC - /NLP-API -. Asthmatic children classified as NLP-PAC + /NLP-API + showed earlier onset asthma, more Th2-high profile, poorer lung function, higher asthma exacerbation and higher risk of asthma-associated comorbidities compared with other groups. These results were consistent with those based on unsupervised cluster analysis and lab and PFT data of a random sample of study subjects. Conclusion Expert AI-based NLP algorithms for two asthma criteria systematically identify childhood asthma with distinctive characteristics. This approach may improve precision, reproducibility, consistency and efficiency of large-scale clinical studies for asthma and enable population management.

Original language	English (US)
Article number	e000524
Journal	BMJ Open Respiratory Research
Volume	7
Issue number	1
DOIs	https://doi.org/10.1136/bmjresp-2019-000524
State	Published - Feb 4 2020

Keywords

asthma
asthma epidemiology
paediatric asthma

ASJC Scopus subject areas

Pulmonary and Respiratory Medicine

Access to Document

10.1136/bmjresp-2019-000524

Cite this

Seol, H. Y., Rolfes, M. C., Chung, W., Sohn, S., Ryu, E., Park, M. A., Kita, H., Ono, J., Croghan, I., Armasu, S. M., Castro-Rodriguez, J. A., Weston, J. D., Liu, H., & Juhn, Y. (2020). Expert artificial intelligence-based natural language processing characterises childhood asthma. BMJ Open Respiratory Research, 7(1), Article e000524. https://doi.org/10.1136/bmjresp-2019-000524

@article{ba5d7be05e3d4ca99903bb48903b0f6d,

title = "Expert artificial intelligence-based natural language processing characterises childhood asthma",

abstract = "Introduction The lack of effective, consistent, reproducible and efficient asthma ascertainment methods results in inconsistent asthma cohorts and study results for clinical trials or other studies. We aimed to assess whether application of expert artificial intelligence (AI)-based natural language processing (NLP) algorithms for two existing asthma criteria to electronic health records of a paediatric population systematically identifies childhood asthma and its subgroups with distinctive characteristics. Methods Using the 1997-2007 Olmsted County Birth Cohort, we applied validated NLP algorithms for Predetermined Asthma Criteria (NLP-PAC) as well as Asthma Predictive Index (NLP-API). We categorised subjects into four groups (both criteria positive (NLP-PAC + /NLP-API +); PAC positive only (NLP-PAC + only); API positive only (NLP-API + only); and both criteria negative (NLP-PAC - /NLP-API -)) and characterised them. Results were replicated in unsupervised cluster analysis for asthmatics and a random sample of 300 children using laboratory and pulmonary function tests (PFTs). Results Of the 8196 subjects (51% male, 80% white), we identified 1614 (20%), NLP-PAC + /NLP-API +; 954 (12%), NLP-PAC + only; 105 (1%), NLP-API + only; and 5523 (67%), NLP-PAC - /NLP-API -. Asthmatic children classified as NLP-PAC + /NLP-API + showed earlier onset asthma, more Th2-high profile, poorer lung function, higher asthma exacerbation and higher risk of asthma-associated comorbidities compared with other groups. These results were consistent with those based on unsupervised cluster analysis and lab and PFT data of a random sample of study subjects. Conclusion Expert AI-based NLP algorithms for two asthma criteria systematically identify childhood asthma with distinctive characteristics. This approach may improve precision, reproducibility, consistency and efficiency of large-scale clinical studies for asthma and enable population management.",

keywords = "asthma, asthma epidemiology, paediatric asthma",

author = "Seol, {Hee Yun} and Rolfes, {Mary C.} and Wi Chung and Sunghwan Sohn and Euijung Ryu and Park, {Miguel A.} and Hirohito Kita and Junya Ono and Ivana Croghan and Armasu, {Sebastian M.} and Castro-Rodriguez, {Jose A.} and Weston, {Jill D.} and Hongfang Liu and Young Juhn",

note = "Publisher Copyright: {\textcopyright} Author (s).",

year = "2020",

month = feb,

day = "4",

doi = "10.1136/bmjresp-2019-000524",

language = "English (US)",

volume = "7",

journal = "BMJ Open Respiratory Research",

issn = "2052-4439",

publisher = "BMJ Publishing Group",

number = "1",

}

TY - JOUR

T1 - Expert artificial intelligence-based natural language processing characterises childhood asthma

AU - Seol, Hee Yun

AU - Rolfes, Mary C.

AU - Chung, Wi

AU - Sohn, Sunghwan

AU - Ryu, Euijung

AU - Park, Miguel A.

AU - Kita, Hirohito

AU - Ono, Junya

AU - Croghan, Ivana

AU - Armasu, Sebastian M.

AU - Castro-Rodriguez, Jose A.

AU - Weston, Jill D.

AU - Liu, Hongfang

AU - Juhn, Young

PY - 2020/2/4

Y1 - 2020/2/4

N2 - Introduction The lack of effective, consistent, reproducible and efficient asthma ascertainment methods results in inconsistent asthma cohorts and study results for clinical trials or other studies. We aimed to assess whether application of expert artificial intelligence (AI)-based natural language processing (NLP) algorithms for two existing asthma criteria to electronic health records of a paediatric population systematically identifies childhood asthma and its subgroups with distinctive characteristics. Methods Using the 1997-2007 Olmsted County Birth Cohort, we applied validated NLP algorithms for Predetermined Asthma Criteria (NLP-PAC) as well as Asthma Predictive Index (NLP-API). We categorised subjects into four groups (both criteria positive (NLP-PAC + /NLP-API +); PAC positive only (NLP-PAC + only); API positive only (NLP-API + only); and both criteria negative (NLP-PAC - /NLP-API -)) and characterised them. Results were replicated in unsupervised cluster analysis for asthmatics and a random sample of 300 children using laboratory and pulmonary function tests (PFTs). Results Of the 8196 subjects (51% male, 80% white), we identified 1614 (20%), NLP-PAC + /NLP-API +; 954 (12%), NLP-PAC + only; 105 (1%), NLP-API + only; and 5523 (67%), NLP-PAC - /NLP-API -. Asthmatic children classified as NLP-PAC + /NLP-API + showed earlier onset asthma, more Th2-high profile, poorer lung function, higher asthma exacerbation and higher risk of asthma-associated comorbidities compared with other groups. These results were consistent with those based on unsupervised cluster analysis and lab and PFT data of a random sample of study subjects. Conclusion Expert AI-based NLP algorithms for two asthma criteria systematically identify childhood asthma with distinctive characteristics. This approach may improve precision, reproducibility, consistency and efficiency of large-scale clinical studies for asthma and enable population management.

AB - Introduction The lack of effective, consistent, reproducible and efficient asthma ascertainment methods results in inconsistent asthma cohorts and study results for clinical trials or other studies. We aimed to assess whether application of expert artificial intelligence (AI)-based natural language processing (NLP) algorithms for two existing asthma criteria to electronic health records of a paediatric population systematically identifies childhood asthma and its subgroups with distinctive characteristics. Methods Using the 1997-2007 Olmsted County Birth Cohort, we applied validated NLP algorithms for Predetermined Asthma Criteria (NLP-PAC) as well as Asthma Predictive Index (NLP-API). We categorised subjects into four groups (both criteria positive (NLP-PAC + /NLP-API +); PAC positive only (NLP-PAC + only); API positive only (NLP-API + only); and both criteria negative (NLP-PAC - /NLP-API -)) and characterised them. Results were replicated in unsupervised cluster analysis for asthmatics and a random sample of 300 children using laboratory and pulmonary function tests (PFTs). Results Of the 8196 subjects (51% male, 80% white), we identified 1614 (20%), NLP-PAC + /NLP-API +; 954 (12%), NLP-PAC + only; 105 (1%), NLP-API + only; and 5523 (67%), NLP-PAC - /NLP-API -. Asthmatic children classified as NLP-PAC + /NLP-API + showed earlier onset asthma, more Th2-high profile, poorer lung function, higher asthma exacerbation and higher risk of asthma-associated comorbidities compared with other groups. These results were consistent with those based on unsupervised cluster analysis and lab and PFT data of a random sample of study subjects. Conclusion Expert AI-based NLP algorithms for two asthma criteria systematically identify childhood asthma with distinctive characteristics. This approach may improve precision, reproducibility, consistency and efficiency of large-scale clinical studies for asthma and enable population management.

KW - asthma

KW - asthma epidemiology

KW - paediatric asthma

UR - http://www.scopus.com/inward/record.url?scp=85078661728&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85078661728&partnerID=8YFLogxK

U2 - 10.1136/bmjresp-2019-000524

DO - 10.1136/bmjresp-2019-000524

M3 - Article

C2 - 33371009

AN - SCOPUS:85078661728

SN - 2052-4439

VL - 7

JO - BMJ Open Respiratory Research

JF - BMJ Open Respiratory Research

IS - 1

M1 - e000524

ER -

Expert artificial intelligence-based natural language processing characterises childhood asthma

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this