Quantitative CT and machine learning classification of fibrotic interstitial lung diseases

Chi Wan Koo; James M. Williams; Grace Liu; Ananya Panda; Parth P. Patel; Livia Maria M. Frota Lima; Ronald A. Karwoski; Teng Moua; Nicholas B. Larson; Alex Bratt

doi:10.1007/s00330-022-08875-4

Quantitative CT and machine learning classification of fibrotic interstitial lung diseases

Chi Wan Koo, James M. Williams, Grace Liu, Ananya Panda, Parth P. Patel, Livia Maria M. Frota Lima, Ronald A. Karwoski, Teng Moua, Nicholas B. Larson, Alex Bratt

Research output: Contribution to journal › Article › peer-review

Abstract

Objectives: To evaluate quantitative computed tomography (QCT) features and QCT feature-based machine learning (ML) models in classifying interstitial lung diseases (ILDs). To compare QCT-ML and deep learning (DL) models’ performance. Methods: We retrospectively identified 1085 patients with pathologically proven usual interstitial pneumonitis (UIP), nonspecific interstitial pneumonitis (NSIP), and chronic hypersensitivity pneumonitis (CHP) who underwent peri-biopsy chest CT. Kruskal-Wallis test evaluated QCT feature associations with each ILD. QCT features, patient demographics, and pulmonary function test (PFT) results trained eXtreme Gradient Boosting (training/validation set n = 911) yielding 3 models: M1 = QCT features only; M2 = M1 plus age and sex; M3 = M2 plus PFT results. A DL model was also developed. ML and DL model areas under the receiver operating characteristic curve (AUC) and 95% confidence intervals (CIs) were compared for multiclass (UIP vs. NSIP vs. CHP) and binary (UIP vs. non-UIP) classification performances. Results: The majority (69/78 [88%]) of QCT features successfully differentiated the 3 ILDs (adjusted p ≤ 0.05). All QCT-ML models achieved higher AUC than the DL model (multiclass AUC micro-averages 0.910, 0.910, 0.925, and 0.798 and macro-averages 0.895, 0.893, 0.925, and 0.779 for M1, M2, M3, and DL respectively; binary AUC 0.880, 0.899, 0.898, and 0.869 for M1, M2, M3, and DL respectively). M3 demonstrated statistically significant better performance compared to M2 (∆AUC: 0.015, CI: [0.002, 0.029]) for multiclass prediction. Conclusions: QCT features successfully differentiated pathologically proven UIP, NSIP, and CHP. While QCT-based ML models outperformed a DL model for classifying ILDs, further investigations are warranted to determine if QCT-ML, DL, or a combination will be superior in ILD classification. Key Points: • Quantitative CT features successfully differentiated pathologically proven UIP, NSIP, and CHP. • Our quantitative CT-based machine learning models demonstrated high performance in classifying UIP, NSIP, and CHP histopathology, outperforming a deep learning model. • While our quantitative CT-based machine learning models performed better than a DL model, additional investigations are needed to determine whether either or a combination of both approaches delivers superior diagnostic performance.

Original language	English (US)
Pages (from-to)	8152-8161
Number of pages	10
Journal	European radiology
Volume	32
Issue number	12
DOIs	https://doi.org/10.1007/s00330-022-08875-4
State	Published - Dec 2022

Keywords

Chronic hypersensitivity pneumonitis
Interstitial lung disease
Machine learning
Nonspecific interstitial pneumonitis
Usual interstitial pneumonitis

ASJC Scopus subject areas

Radiology Nuclear Medicine and imaging

Access to Document

10.1007/s00330-022-08875-4

Cite this

@article{055e470823f14ee2a928c8876b6085fe,

title = "Quantitative CT and machine learning classification of fibrotic interstitial lung diseases",

abstract = "Objectives: To evaluate quantitative computed tomography (QCT) features and QCT feature-based machine learning (ML) models in classifying interstitial lung diseases (ILDs). To compare QCT-ML and deep learning (DL) models{\textquoteright} performance. Methods: We retrospectively identified 1085 patients with pathologically proven usual interstitial pneumonitis (UIP), nonspecific interstitial pneumonitis (NSIP), and chronic hypersensitivity pneumonitis (CHP) who underwent peri-biopsy chest CT. Kruskal-Wallis test evaluated QCT feature associations with each ILD. QCT features, patient demographics, and pulmonary function test (PFT) results trained eXtreme Gradient Boosting (training/validation set n = 911) yielding 3 models: M1 = QCT features only; M2 = M1 plus age and sex; M3 = M2 plus PFT results. A DL model was also developed. ML and DL model areas under the receiver operating characteristic curve (AUC) and 95% confidence intervals (CIs) were compared for multiclass (UIP vs. NSIP vs. CHP) and binary (UIP vs. non-UIP) classification performances. Results: The majority (69/78 [88%]) of QCT features successfully differentiated the 3 ILDs (adjusted p ≤ 0.05). All QCT-ML models achieved higher AUC than the DL model (multiclass AUC micro-averages 0.910, 0.910, 0.925, and 0.798 and macro-averages 0.895, 0.893, 0.925, and 0.779 for M1, M2, M3, and DL respectively; binary AUC 0.880, 0.899, 0.898, and 0.869 for M1, M2, M3, and DL respectively). M3 demonstrated statistically significant better performance compared to M2 (∆AUC: 0.015, CI: [0.002, 0.029]) for multiclass prediction. Conclusions: QCT features successfully differentiated pathologically proven UIP, NSIP, and CHP. While QCT-based ML models outperformed a DL model for classifying ILDs, further investigations are warranted to determine if QCT-ML, DL, or a combination will be superior in ILD classification. Key Points: • Quantitative CT features successfully differentiated pathologically proven UIP, NSIP, and CHP. • Our quantitative CT-based machine learning models demonstrated high performance in classifying UIP, NSIP, and CHP histopathology, outperforming a deep learning model. • While our quantitative CT-based machine learning models performed better than a DL model, additional investigations are needed to determine whether either or a combination of both approaches delivers superior diagnostic performance.",

keywords = "Chronic hypersensitivity pneumonitis, Interstitial lung disease, Machine learning, Nonspecific interstitial pneumonitis, Usual interstitial pneumonitis",

author = "Koo, {Chi Wan} and Williams, {James M.} and Grace Liu and Ananya Panda and Patel, {Parth P.} and {Frota Lima}, {Livia Maria M.} and Karwoski, {Ronald A.} and Teng Moua and Larson, {Nicholas B.} and Alex Bratt",

note = "Publisher Copyright: {\textcopyright} 2022, The Author(s), under exclusive licence to European Society of Radiology.",

year = "2022",

month = dec,

doi = "10.1007/s00330-022-08875-4",

language = "English (US)",

volume = "32",

pages = "8152--8161",

journal = "European radiology",

issn = "0938-7994",

publisher = "Springer Verlag",

number = "12",

}

TY - JOUR

T1 - Quantitative CT and machine learning classification of fibrotic interstitial lung diseases

AU - Koo, Chi Wan

AU - Williams, James M.

AU - Liu, Grace

AU - Panda, Ananya

AU - Patel, Parth P.

AU - Frota Lima, Livia Maria M.

AU - Karwoski, Ronald A.

AU - Moua, Teng

AU - Larson, Nicholas B.

AU - Bratt, Alex

PY - 2022/12

Y1 - 2022/12

N2 - Objectives: To evaluate quantitative computed tomography (QCT) features and QCT feature-based machine learning (ML) models in classifying interstitial lung diseases (ILDs). To compare QCT-ML and deep learning (DL) models’ performance. Methods: We retrospectively identified 1085 patients with pathologically proven usual interstitial pneumonitis (UIP), nonspecific interstitial pneumonitis (NSIP), and chronic hypersensitivity pneumonitis (CHP) who underwent peri-biopsy chest CT. Kruskal-Wallis test evaluated QCT feature associations with each ILD. QCT features, patient demographics, and pulmonary function test (PFT) results trained eXtreme Gradient Boosting (training/validation set n = 911) yielding 3 models: M1 = QCT features only; M2 = M1 plus age and sex; M3 = M2 plus PFT results. A DL model was also developed. ML and DL model areas under the receiver operating characteristic curve (AUC) and 95% confidence intervals (CIs) were compared for multiclass (UIP vs. NSIP vs. CHP) and binary (UIP vs. non-UIP) classification performances. Results: The majority (69/78 [88%]) of QCT features successfully differentiated the 3 ILDs (adjusted p ≤ 0.05). All QCT-ML models achieved higher AUC than the DL model (multiclass AUC micro-averages 0.910, 0.910, 0.925, and 0.798 and macro-averages 0.895, 0.893, 0.925, and 0.779 for M1, M2, M3, and DL respectively; binary AUC 0.880, 0.899, 0.898, and 0.869 for M1, M2, M3, and DL respectively). M3 demonstrated statistically significant better performance compared to M2 (∆AUC: 0.015, CI: [0.002, 0.029]) for multiclass prediction. Conclusions: QCT features successfully differentiated pathologically proven UIP, NSIP, and CHP. While QCT-based ML models outperformed a DL model for classifying ILDs, further investigations are warranted to determine if QCT-ML, DL, or a combination will be superior in ILD classification. Key Points: • Quantitative CT features successfully differentiated pathologically proven UIP, NSIP, and CHP. • Our quantitative CT-based machine learning models demonstrated high performance in classifying UIP, NSIP, and CHP histopathology, outperforming a deep learning model. • While our quantitative CT-based machine learning models performed better than a DL model, additional investigations are needed to determine whether either or a combination of both approaches delivers superior diagnostic performance.

AB - Objectives: To evaluate quantitative computed tomography (QCT) features and QCT feature-based machine learning (ML) models in classifying interstitial lung diseases (ILDs). To compare QCT-ML and deep learning (DL) models’ performance. Methods: We retrospectively identified 1085 patients with pathologically proven usual interstitial pneumonitis (UIP), nonspecific interstitial pneumonitis (NSIP), and chronic hypersensitivity pneumonitis (CHP) who underwent peri-biopsy chest CT. Kruskal-Wallis test evaluated QCT feature associations with each ILD. QCT features, patient demographics, and pulmonary function test (PFT) results trained eXtreme Gradient Boosting (training/validation set n = 911) yielding 3 models: M1 = QCT features only; M2 = M1 plus age and sex; M3 = M2 plus PFT results. A DL model was also developed. ML and DL model areas under the receiver operating characteristic curve (AUC) and 95% confidence intervals (CIs) were compared for multiclass (UIP vs. NSIP vs. CHP) and binary (UIP vs. non-UIP) classification performances. Results: The majority (69/78 [88%]) of QCT features successfully differentiated the 3 ILDs (adjusted p ≤ 0.05). All QCT-ML models achieved higher AUC than the DL model (multiclass AUC micro-averages 0.910, 0.910, 0.925, and 0.798 and macro-averages 0.895, 0.893, 0.925, and 0.779 for M1, M2, M3, and DL respectively; binary AUC 0.880, 0.899, 0.898, and 0.869 for M1, M2, M3, and DL respectively). M3 demonstrated statistically significant better performance compared to M2 (∆AUC: 0.015, CI: [0.002, 0.029]) for multiclass prediction. Conclusions: QCT features successfully differentiated pathologically proven UIP, NSIP, and CHP. While QCT-based ML models outperformed a DL model for classifying ILDs, further investigations are warranted to determine if QCT-ML, DL, or a combination will be superior in ILD classification. Key Points: • Quantitative CT features successfully differentiated pathologically proven UIP, NSIP, and CHP. • Our quantitative CT-based machine learning models demonstrated high performance in classifying UIP, NSIP, and CHP histopathology, outperforming a deep learning model. • While our quantitative CT-based machine learning models performed better than a DL model, additional investigations are needed to determine whether either or a combination of both approaches delivers superior diagnostic performance.

KW - Chronic hypersensitivity pneumonitis

KW - Interstitial lung disease

KW - Machine learning

KW - Nonspecific interstitial pneumonitis

KW - Usual interstitial pneumonitis

UR - http://www.scopus.com/inward/record.url?scp=85131583232&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85131583232&partnerID=8YFLogxK

U2 - 10.1007/s00330-022-08875-4

DO - 10.1007/s00330-022-08875-4

M3 - Article

C2 - 35678861

AN - SCOPUS:85131583232

SN - 0938-7994

VL - 32

SP - 8152

EP - 8161

JO - European radiology

JF - European radiology

IS - 12

ER -

Quantitative CT and machine learning classification of fibrotic interstitial lung diseases

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this