Ensemble Deep Learning Derived from Transfer Learning for Classification of COVID-19 Patients on Hybrid Deep-Learning-Based Lung Segmentation: A Data Augmentation and Balancing Framework

Arun Kumar Dubey, Gian Luca Chabert, Alessandro Carriero, Alessio Pasche, Pietro S.C. Danna, Sushant Agarwal, Lopamudra Mohanty, Nillmani, Neeraj Sharma, Sarita Yadav, Achin Jain, Ashish Kumar, Mannudeep K. Kalra, David W. Sobel, John R. Laird, Inder M. Singh, Narpinder Singh, George Tsoulfas, Mostafa M. Fouda, Azra AlizadGeorge D. Kitas, Narendra N. Khanna, Klaudija Viskovic, Melita Kukuljan, Mustafa Al-Maini, Ayman El-Baz, Luca Saba, Jasjit S. Suri

Research output: Contribution to journalArticlepeer-review

Abstract

Background and motivation: Lung computed tomography (CT) techniques are high-resolution and are well adopted in the intensive care unit (ICU) for COVID-19 disease control classification. Most artificial intelligence (AI) systems do not undergo generalization and are typically overfitted. Such trained AI systems are not practical for clinical settings and therefore do not give accurate results when executed on unseen data sets. We hypothesize that ensemble deep learning (EDL) is superior to deep transfer learning (TL) in both non-augmented and augmented frameworks. Methodology: The system consists of a cascade of quality control, ResNet–UNet-based hybrid deep learning for lung segmentation, and seven models using TL-based classification followed by five types of EDL’s. To prove our hypothesis, five different kinds of data combinations (DC) were designed using a combination of two multicenter cohorts—Croatia (80 COVID) and Italy (72 COVID and 30 controls)—leading to 12,000 CT slices. As part of generalization, the system was tested on unseen data and statistically tested for reliability/stability. Results: Using the K5 (80:20) cross-validation protocol on the balanced and augmented dataset, the five DC datasets improved TL mean accuracy by 3.32%, 6.56%, 12.96%, 47.1%, and 2.78%, respectively. The five EDL systems showed improvements in accuracy of 2.12%, 5.78%, 6.72%, 32.05%, and 2.40%, thus validating our hypothesis. All statistical tests proved positive for reliability and stability. Conclusion: EDL showed superior performance to TL systems for both (a) unbalanced and unaugmented and (b) balanced and augmented datasets for both (i) seen and (ii) unseen paradigms, validating both our hypotheses.

Original languageEnglish (US)
Article number1954
JournalDiagnostics
Volume13
Issue number11
DOIs
StatePublished - Jun 2023

Keywords

  • COVID
  • ResNet–UNet
  • control
  • ensemble deep learning
  • transfer learning
  • unseen

ASJC Scopus subject areas

  • Clinical Biochemistry

Fingerprint

Dive into the research topics of 'Ensemble Deep Learning Derived from Transfer Learning for Classification of COVID-19 Patients on Hybrid Deep-Learning-Based Lung Segmentation: A Data Augmentation and Balancing Framework'. Together they form a unique fingerprint.

Cite this