Ensemble Deep Learning Derived from Transfer Learning for Classification of COVID-19 Patients on Hybrid Deep-Learning-Based Lung Segmentation: A Data Augmentation and Balancing Framework

Arun Kumar Dubey; Gian Luca Chabert; Alessandro Carriero; Alessio Pasche; Pietro S.C. Danna; Sushant Agarwal; Lopamudra Mohanty; Nillmani; Neeraj Sharma; Sarita Yadav; Achin Jain; Ashish Kumar; Mannudeep K. Kalra; David W. Sobel; John R. Laird; Inder M. Singh; Narpinder Singh; George Tsoulfas; Mostafa M. Fouda; Azra Alizad; George D. Kitas; Narendra N. Khanna; Klaudija Viskovic; Melita Kukuljan; Mustafa Al-Maini; Ayman El-Baz; Luca Saba; Jasjit S. Suri

doi:10.3390/diagnostics13111954

Ensemble Deep Learning Derived from Transfer Learning for Classification of COVID-19 Patients on Hybrid Deep-Learning-Based Lung Segmentation: A Data Augmentation and Balancing Framework

Arun Kumar Dubey, Gian Luca Chabert, Alessandro Carriero, Alessio Pasche, Pietro S.C. Danna, Sushant Agarwal, Lopamudra Mohanty, Nillmani, Neeraj Sharma, Sarita Yadav, Achin Jain, Ashish Kumar, Mannudeep K. Kalra, David W. Sobel, John R. Laird, Inder M. Singh, Narpinder Singh, George Tsoulfas, Mostafa M. Fouda, Azra AlizadGeorge D. Kitas, Narendra N. Khanna, Klaudija Viskovic, Melita Kukuljan, Mustafa Al-Maini, Ayman El-Baz, Luca Saba, Jasjit S. Suri

Physiology & Biomedical Engineering

Research output: Contribution to journal › Article › peer-review

Abstract

Background and motivation: Lung computed tomography (CT) techniques are high-resolution and are well adopted in the intensive care unit (ICU) for COVID-19 disease control classification. Most artificial intelligence (AI) systems do not undergo generalization and are typically overfitted. Such trained AI systems are not practical for clinical settings and therefore do not give accurate results when executed on unseen data sets. We hypothesize that ensemble deep learning (EDL) is superior to deep transfer learning (TL) in both non-augmented and augmented frameworks. Methodology: The system consists of a cascade of quality control, ResNet–UNet-based hybrid deep learning for lung segmentation, and seven models using TL-based classification followed by five types of EDL’s. To prove our hypothesis, five different kinds of data combinations (DC) were designed using a combination of two multicenter cohorts—Croatia (80 COVID) and Italy (72 COVID and 30 controls)—leading to 12,000 CT slices. As part of generalization, the system was tested on unseen data and statistically tested for reliability/stability. Results: Using the K5 (80:20) cross-validation protocol on the balanced and augmented dataset, the five DC datasets improved TL mean accuracy by 3.32%, 6.56%, 12.96%, 47.1%, and 2.78%, respectively. The five EDL systems showed improvements in accuracy of 2.12%, 5.78%, 6.72%, 32.05%, and 2.40%, thus validating our hypothesis. All statistical tests proved positive for reliability and stability. Conclusion: EDL showed superior performance to TL systems for both (a) unbalanced and unaugmented and (b) balanced and augmented datasets for both (i) seen and (ii) unseen paradigms, validating both our hypotheses.

Original language	English (US)
Article number	1954
Journal	Diagnostics
Volume	13
Issue number	11
DOIs	https://doi.org/10.3390/diagnostics13111954
State	Published - Jun 2023

Keywords

COVID
ResNet–UNet
control
ensemble deep learning
transfer learning
unseen

ASJC Scopus subject areas

Clinical Biochemistry

Access to Document

10.3390/diagnostics13111954

Cite this

Dubey, A. K., Chabert, G. L., Carriero, A., Pasche, A., Danna, P. S. C., Agarwal, S., Mohanty, L., Nillmani, Sharma, N., Yadav, S., Jain, A., Kumar, A., Kalra, M. K., Sobel, D. W., Laird, J. R., Singh, I. M., Singh, N., Tsoulfas, G., Fouda, M. M., ... Suri, J. S. (2023). Ensemble Deep Learning Derived from Transfer Learning for Classification of COVID-19 Patients on Hybrid Deep-Learning-Based Lung Segmentation: A Data Augmentation and Balancing Framework. Diagnostics, 13(11), Article 1954. https://doi.org/10.3390/diagnostics13111954

Ensemble Deep Learning Derived from Transfer Learning for Classification of COVID-19 Patients on Hybrid Deep-Learning-Based Lung Segmentation: A Data Augmentation and Balancing Framework. / Dubey, Arun Kumar; Chabert, Gian Luca; Carriero, Alessandro et al.
In: Diagnostics, Vol. 13, No. 11, 1954, 06.2023.

Research output: Contribution to journal › Article › peer-review

Dubey, AK, Chabert, GL, Carriero, A, Pasche, A, Danna, PSC, Agarwal, S, Mohanty, L, Nillmani, Sharma, N, Yadav, S, Jain, A, Kumar, A, Kalra, MK, Sobel, DW, Laird, JR, Singh, IM, Singh, N, Tsoulfas, G, Fouda, MM, Alizad, A, Kitas, GD, Khanna, NN, Viskovic, K, Kukuljan, M, Al-Maini, M, El-Baz, A, Saba, L & Suri, JS 2023, 'Ensemble Deep Learning Derived from Transfer Learning for Classification of COVID-19 Patients on Hybrid Deep-Learning-Based Lung Segmentation: A Data Augmentation and Balancing Framework', Diagnostics, vol. 13, no. 11, 1954. https://doi.org/10.3390/diagnostics13111954

@article{e0cc5af6258942d29b487c277dbb6ff0,

title = "Ensemble Deep Learning Derived from Transfer Learning for Classification of COVID-19 Patients on Hybrid Deep-Learning-Based Lung Segmentation: A Data Augmentation and Balancing Framework",

abstract = "Background and motivation: Lung computed tomography (CT) techniques are high-resolution and are well adopted in the intensive care unit (ICU) for COVID-19 disease control classification. Most artificial intelligence (AI) systems do not undergo generalization and are typically overfitted. Such trained AI systems are not practical for clinical settings and therefore do not give accurate results when executed on unseen data sets. We hypothesize that ensemble deep learning (EDL) is superior to deep transfer learning (TL) in both non-augmented and augmented frameworks. Methodology: The system consists of a cascade of quality control, ResNet–UNet-based hybrid deep learning for lung segmentation, and seven models using TL-based classification followed by five types of EDL{\textquoteright}s. To prove our hypothesis, five different kinds of data combinations (DC) were designed using a combination of two multicenter cohorts—Croatia (80 COVID) and Italy (72 COVID and 30 controls)—leading to 12,000 CT slices. As part of generalization, the system was tested on unseen data and statistically tested for reliability/stability. Results: Using the K5 (80:20) cross-validation protocol on the balanced and augmented dataset, the five DC datasets improved TL mean accuracy by 3.32%, 6.56%, 12.96%, 47.1%, and 2.78%, respectively. The five EDL systems showed improvements in accuracy of 2.12%, 5.78%, 6.72%, 32.05%, and 2.40%, thus validating our hypothesis. All statistical tests proved positive for reliability and stability. Conclusion: EDL showed superior performance to TL systems for both (a) unbalanced and unaugmented and (b) balanced and augmented datasets for both (i) seen and (ii) unseen paradigms, validating both our hypotheses.",

keywords = "COVID, ResNet–UNet, control, ensemble deep learning, transfer learning, unseen",

author = "Dubey, {Arun Kumar} and Chabert, {Gian Luca} and Alessandro Carriero and Alessio Pasche and Danna, {Pietro S.C.} and Sushant Agarwal and Lopamudra Mohanty and Nillmani and Neeraj Sharma and Sarita Yadav and Achin Jain and Ashish Kumar and Kalra, {Mannudeep K.} and Sobel, {David W.} and Laird, {John R.} and Singh, {Inder M.} and Narpinder Singh and George Tsoulfas and Fouda, {Mostafa M.} and Azra Alizad and Kitas, {George D.} and Khanna, {Narendra N.} and Klaudija Viskovic and Melita Kukuljan and Mustafa Al-Maini and Ayman El-Baz and Luca Saba and Suri, {Jasjit S.}",

note = "Publisher Copyright: {\textcopyright} 2023 by the authors.",

year = "2023",

month = jun,

doi = "10.3390/diagnostics13111954",

language = "English (US)",

volume = "13",

journal = "Diagnostics",

issn = "2075-4418",

publisher = "MDPI AG",

number = "11",

}

TY - JOUR

T1 - Ensemble Deep Learning Derived from Transfer Learning for Classification of COVID-19 Patients on Hybrid Deep-Learning-Based Lung Segmentation

T2 - A Data Augmentation and Balancing Framework

AU - Dubey, Arun Kumar

AU - Chabert, Gian Luca

AU - Carriero, Alessandro

AU - Pasche, Alessio

AU - Danna, Pietro S.C.

AU - Agarwal, Sushant

AU - Mohanty, Lopamudra

AU - Nillmani,

AU - Sharma, Neeraj

AU - Yadav, Sarita

AU - Jain, Achin

AU - Kumar, Ashish

AU - Kalra, Mannudeep K.

AU - Sobel, David W.

AU - Laird, John R.

AU - Singh, Inder M.

AU - Singh, Narpinder

AU - Tsoulfas, George

AU - Fouda, Mostafa M.

AU - Alizad, Azra

AU - Kitas, George D.

AU - Khanna, Narendra N.

AU - Viskovic, Klaudija

AU - Kukuljan, Melita

AU - Al-Maini, Mustafa

AU - El-Baz, Ayman

AU - Saba, Luca

AU - Suri, Jasjit S.

PY - 2023/6

Y1 - 2023/6

N2 - Background and motivation: Lung computed tomography (CT) techniques are high-resolution and are well adopted in the intensive care unit (ICU) for COVID-19 disease control classification. Most artificial intelligence (AI) systems do not undergo generalization and are typically overfitted. Such trained AI systems are not practical for clinical settings and therefore do not give accurate results when executed on unseen data sets. We hypothesize that ensemble deep learning (EDL) is superior to deep transfer learning (TL) in both non-augmented and augmented frameworks. Methodology: The system consists of a cascade of quality control, ResNet–UNet-based hybrid deep learning for lung segmentation, and seven models using TL-based classification followed by five types of EDL’s. To prove our hypothesis, five different kinds of data combinations (DC) were designed using a combination of two multicenter cohorts—Croatia (80 COVID) and Italy (72 COVID and 30 controls)—leading to 12,000 CT slices. As part of generalization, the system was tested on unseen data and statistically tested for reliability/stability. Results: Using the K5 (80:20) cross-validation protocol on the balanced and augmented dataset, the five DC datasets improved TL mean accuracy by 3.32%, 6.56%, 12.96%, 47.1%, and 2.78%, respectively. The five EDL systems showed improvements in accuracy of 2.12%, 5.78%, 6.72%, 32.05%, and 2.40%, thus validating our hypothesis. All statistical tests proved positive for reliability and stability. Conclusion: EDL showed superior performance to TL systems for both (a) unbalanced and unaugmented and (b) balanced and augmented datasets for both (i) seen and (ii) unseen paradigms, validating both our hypotheses.

AB - Background and motivation: Lung computed tomography (CT) techniques are high-resolution and are well adopted in the intensive care unit (ICU) for COVID-19 disease control classification. Most artificial intelligence (AI) systems do not undergo generalization and are typically overfitted. Such trained AI systems are not practical for clinical settings and therefore do not give accurate results when executed on unseen data sets. We hypothesize that ensemble deep learning (EDL) is superior to deep transfer learning (TL) in both non-augmented and augmented frameworks. Methodology: The system consists of a cascade of quality control, ResNet–UNet-based hybrid deep learning for lung segmentation, and seven models using TL-based classification followed by five types of EDL’s. To prove our hypothesis, five different kinds of data combinations (DC) were designed using a combination of two multicenter cohorts—Croatia (80 COVID) and Italy (72 COVID and 30 controls)—leading to 12,000 CT slices. As part of generalization, the system was tested on unseen data and statistically tested for reliability/stability. Results: Using the K5 (80:20) cross-validation protocol on the balanced and augmented dataset, the five DC datasets improved TL mean accuracy by 3.32%, 6.56%, 12.96%, 47.1%, and 2.78%, respectively. The five EDL systems showed improvements in accuracy of 2.12%, 5.78%, 6.72%, 32.05%, and 2.40%, thus validating our hypothesis. All statistical tests proved positive for reliability and stability. Conclusion: EDL showed superior performance to TL systems for both (a) unbalanced and unaugmented and (b) balanced and augmented datasets for both (i) seen and (ii) unseen paradigms, validating both our hypotheses.

KW - COVID

KW - ResNet–UNet

KW - control

KW - ensemble deep learning

KW - transfer learning

KW - unseen

UR - http://www.scopus.com/inward/record.url?scp=85161726819&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85161726819&partnerID=8YFLogxK

U2 - 10.3390/diagnostics13111954

DO - 10.3390/diagnostics13111954

M3 - Article

AN - SCOPUS:85161726819

SN - 2075-4418

VL - 13

JO - Diagnostics

JF - Diagnostics

IS - 11

M1 - 1954

ER -

Ensemble Deep Learning Derived from Transfer Learning for Classification of COVID-19 Patients on Hybrid Deep-Learning-Based Lung Segmentation: A Data Augmentation and Balancing Framework

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this