TY - JOUR
T1 - Radiomics-based Machine-learning Models Can Detect Pancreatic Cancer on Prediagnostic Computed Tomography Scans at a Substantial Lead Time Before Clinical Diagnosis
AU - Mukherjee, Sovanlal
AU - Patra, Anurima
AU - Khasawneh, Hala
AU - Korfiatis, Panagiotis
AU - Rajamohan, Naveen
AU - Suman, Garima
AU - Majumder, Shounak
AU - Panda, Ananya
AU - Johnson, Matthew P.
AU - Larson, Nicholas B.
AU - Wright, Darryl E.
AU - Kline, Timothy
AU - Fletcher, Joel Garland
AU - Chari, Suresh T
AU - Goenka, Ajit H.
N1 - Funding Information:
Funding Ajit H. Goenka gratefully acknowledges a research grant from the Champions for Hope Pancreatic Cancer Research Program of the Funk Zitiello Foundation , Advance the Practice Award from the Department of Radiology, Mayo Clinic , Rochester, Minnesota, and the Centene Charitable Foundation . Unrelated to this work: Ajit H. Goenka is the principal investigator ( PI ) and supported by CA190188, Department of Defense , Office of the Congressionally Directed Medical Research Programs . Ajit H. Goenka is also the co- PI and supported by R01CA256969, National Cancer Institute of the National Institutes of Health. Ajit H. Goenka is also on the Advisory Board (ad hoc), BlueStar Genomics.
Publisher Copyright:
© 2022 AGA Institute
PY - 2022/11
Y1 - 2022/11
N2 - Background & Aims: Our purpose was to detect pancreatic ductal adenocarcinoma (PDAC) at the prediagnostic stage (3–36 months before clinical diagnosis) using radiomics-based machine-learning (ML) models, and to compare performance against radiologists in a case-control study. Methods: Volumetric pancreas segmentation was performed on prediagnostic computed tomography scans (CTs) (median interval between CT and PDAC diagnosis: 398 days) of 155 patients and an age-matched cohort of 265 subjects with normal pancreas. A total of 88 first-order and gray-level radiomic features were extracted and 34 features were selected through the least absolute shrinkage and selection operator–based feature selection method. The dataset was randomly divided into training (292 CTs: 110 prediagnostic and 182 controls) and test subsets (128 CTs: 45 prediagnostic and 83 controls). Four ML classifiers, k-nearest neighbor (KNN), support vector machine (SVM), random forest (RM), and extreme gradient boosting (XGBoost), were evaluated. Specificity of model with highest accuracy was further validated on an independent internal dataset (n = 176) and the public National Institutes of Health dataset (n = 80). Two radiologists (R4 and R5) independently evaluated the pancreas on a 5-point diagnostic scale. Results: Median (range) time between prediagnostic CTs of the test subset and PDAC diagnosis was 386 (97–1092) days. SVM had the highest sensitivity (mean; 95% confidence interval) (95.5; 85.5–100.0), specificity (90.3; 84.3–91.5), F1-score (89.5; 82.3–91.7), area under the curve (AUC) (0.98; 0.94–0.98), and accuracy (92.2%; 86.7–93.7) for classification of CTs into prediagnostic versus normal. All 3 other ML models, KNN, RF, and XGBoost, had comparable AUCs (0.95, 0.95, and 0.96, respectively). The high specificity of SVM was generalizable to both the independent internal (92.6%) and the National Institutes of Health dataset (96.2%). In contrast, interreader radiologist agreement was only fair (Cohen's kappa 0.3) and their mean AUC (0.66; 0.46–0.86) was lower than each of the 4 ML models (AUCs: 0.95–0.98) (P < .001). Radiologists also recorded false positive indirect findings of PDAC in control subjects (n = 83) (7% R4, 18% R5). Conclusions: Radiomics-based ML models can detect PDAC from normal pancreas when it is beyond human interrogation capability at a substantial lead time before clinical diagnosis. Prospective validation and integration of such models with complementary fluid-based biomarkers has the potential for PDAC detection at a stage when surgical cure is a possibility.
AB - Background & Aims: Our purpose was to detect pancreatic ductal adenocarcinoma (PDAC) at the prediagnostic stage (3–36 months before clinical diagnosis) using radiomics-based machine-learning (ML) models, and to compare performance against radiologists in a case-control study. Methods: Volumetric pancreas segmentation was performed on prediagnostic computed tomography scans (CTs) (median interval between CT and PDAC diagnosis: 398 days) of 155 patients and an age-matched cohort of 265 subjects with normal pancreas. A total of 88 first-order and gray-level radiomic features were extracted and 34 features were selected through the least absolute shrinkage and selection operator–based feature selection method. The dataset was randomly divided into training (292 CTs: 110 prediagnostic and 182 controls) and test subsets (128 CTs: 45 prediagnostic and 83 controls). Four ML classifiers, k-nearest neighbor (KNN), support vector machine (SVM), random forest (RM), and extreme gradient boosting (XGBoost), were evaluated. Specificity of model with highest accuracy was further validated on an independent internal dataset (n = 176) and the public National Institutes of Health dataset (n = 80). Two radiologists (R4 and R5) independently evaluated the pancreas on a 5-point diagnostic scale. Results: Median (range) time between prediagnostic CTs of the test subset and PDAC diagnosis was 386 (97–1092) days. SVM had the highest sensitivity (mean; 95% confidence interval) (95.5; 85.5–100.0), specificity (90.3; 84.3–91.5), F1-score (89.5; 82.3–91.7), area under the curve (AUC) (0.98; 0.94–0.98), and accuracy (92.2%; 86.7–93.7) for classification of CTs into prediagnostic versus normal. All 3 other ML models, KNN, RF, and XGBoost, had comparable AUCs (0.95, 0.95, and 0.96, respectively). The high specificity of SVM was generalizable to both the independent internal (92.6%) and the National Institutes of Health dataset (96.2%). In contrast, interreader radiologist agreement was only fair (Cohen's kappa 0.3) and their mean AUC (0.66; 0.46–0.86) was lower than each of the 4 ML models (AUCs: 0.95–0.98) (P < .001). Radiologists also recorded false positive indirect findings of PDAC in control subjects (n = 83) (7% R4, 18% R5). Conclusions: Radiomics-based ML models can detect PDAC from normal pancreas when it is beyond human interrogation capability at a substantial lead time before clinical diagnosis. Prospective validation and integration of such models with complementary fluid-based biomarkers has the potential for PDAC detection at a stage when surgical cure is a possibility.
KW - Artificial Intelligence
KW - Biomarkers
KW - Pancreas
KW - Pancreatic Ductal Carcinoma
KW - X-Ray Computed Tomography
UR - http://www.scopus.com/inward/record.url?scp=85137820009&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85137820009&partnerID=8YFLogxK
U2 - 10.1053/j.gastro.2022.06.066
DO - 10.1053/j.gastro.2022.06.066
M3 - Article
C2 - 35788343
AN - SCOPUS:85137820009
SN - 0016-5085
VL - 163
SP - 1435-1446.e3
JO - Gastroenterology
JF - Gastroenterology
IS - 5
ER -