TY - JOUR
T1 - Development of a multiomics model for identification of predictive biomarkers for COVID-19 severity
T2 - a retrospective cohort study
AU - Byeon, Seul Kee
AU - Madugundu, Anil K.
AU - Garapati, Kishore
AU - Ramarajan, Madan Gopal
AU - Saraswat, Mayank
AU - Kumar-M, Praveen
AU - Hughes, Travis
AU - Shah, Rameen
AU - Patnaik, Mrinal M.
AU - Chia, Nicholas
AU - Ashrafzadeh-Kian, Susan
AU - Yao, Joseph D.
AU - Pritt, Bobbi S.
AU - Cattaneo, Roberto
AU - Salama, Mohamed E.
AU - Zenka, Roman M.
AU - Kipp, Benjamin R.
AU - Grebe, Stefan K.G.
AU - Singh, Ravinder J.
AU - Sadighi Akha, Amir A.
AU - Algeciras-Schimnich, Alicia
AU - Dasari, Surendra
AU - Olson, Janet E.
AU - Walsh, Jesse R.
AU - Venkatakrishnan, A. J.
AU - Jenkinson, Garrett
AU - O'Horo, John C.
AU - Badley, Andrew D.
AU - Pandey, Akhilesh
N1 - Funding Information:
PK-M, TH, and AJV are employees of nference. JCO receives grants from nference and personal fees from Elsevier and Bates College, outside the submitted work. ADB is supported by grants from NIAID (AI110173 and AI120698), Amfar (109593), and Mayo Clinic (HH Shieck Khalifa Bib Zayed Al-Nahyan Named Professorship of Infectious Diseases). ADB is a paid consultant for AbbVie, Gilead, Freedom Tunnel, Pinetree Therapeutics, Primmune, Immunome, MarPam, Rion, and Flambeau Diagnostics; is a paid member of the DSMB for Corvus Pharmaceuticals, Equilium, and Excision Biotherapeutics; has received fees for speaking for Reach MD, Peer Voice, and Medscape; owns equity for scientific advisory work in Zentalis Rion and nference; and is founder and President of Splissen Therapeutics.
Funding Information:
Funding for the molecular profiling was supported by the generosity of Eric and Wendy Schmidt. We are grateful to Jessica L Lesko for coordinating the project. We thank Jon Harrington, Josh Bublitz, and Terra Lasho for organising and sharing patient and clinical data. We acknowledge Katelyn A Reed for assistance with institutional review boards and logistical support. We would also like to thank Dong-Gi Mun for help with samples.
Publisher Copyright:
© 2022 The Author(s). Published by Elsevier Ltd. This is an Open Access article under the CC BY 4.0 license
PY - 2022/9
Y1 - 2022/9
N2 - Background: COVID-19 is a multi-system disorder with high variability in clinical outcomes among patients who are admitted to hospital. Although some cytokines such as interleukin (IL)-6 are believed to be associated with severity, there are no early biomarkers that can reliably predict patients who are more likely to have adverse outcomes. Thus, it is crucial to discover predictive markers of serious complications. Methods: In this retrospective cohort study, we analysed samples from 455 participants with COVID-19 who had had a positive SARS-CoV-2 RT-PCR result between April 14, 2020, and Dec 1, 2020 and who had visited one of three Mayo Clinic sites in the USA (Minnesota, Arizona, or Florida) in the same period. These participants were assigned to three subgroups depending on disease severity as defined by the WHO ordinal scale of clinical improvement (outpatient, severe, or critical). Our control cohort comprised of 182 anonymised age-matched and sex-matched plasma samples that were available from the Mayo Clinic Biorepository and banked before the COVID-19 pandemic. We did a deep profiling of circulatory cytokines and other proteins, lipids, and metabolites from both cohorts. Most patient samples were collected before, or around the time of, hospital admission, representing ideal samples for predictive biomarker discovery. We used proximity extension assays to quantify cytokines and circulatory proteins and tandem mass spectrometry to measure lipids and metabolites. Biomarker discovery was done by applying an AutoGluon-tabular classifier to a multiomics dataset, producing a stacked ensemble of cutting-edge machine learning algorithms. Global proteomics and glycoproteomics on a subset of patient samples with matched pre-COVID-19 plasma samples was also done. Findings: We quantified 1463 cytokines and circulatory proteins, along with 902 lipids and 1018 metabolites. By developing a machine-learning-based prediction model, a set of 102 biomarkers, which predicted severe and clinical COVID-19 outcomes better than the traditional set of cytokines, were discovered. These predictive biomarkers included several novel cytokines and other proteins, lipids, and metabolites. For example, altered amounts of C-type lectin domain family 6 member A (CLEC6A), ether phosphatidylethanolamine (P-18:1/18:1), and 2-hydroxydecanoate, as reported here, have not previously been associated with severity in COVID-19. Patient samples with matched pre-COVID-19 plasma samples showed similar trends in muti-omics signatures along with differences in glycoproteomics profile. Interpretation: A multiomic molecular signature in the plasma of patients with COVID-19 before being admitted to hospital can be exploited to predict a more severe course of disease. Machine learning approaches can be applied to highly complex and multidimensional profiling data to reveal novel signatures of clinical use. The absence of validation in an independent cohort remains a major limitation of the study. Funding: Eric and Wendy Schmidt.
AB - Background: COVID-19 is a multi-system disorder with high variability in clinical outcomes among patients who are admitted to hospital. Although some cytokines such as interleukin (IL)-6 are believed to be associated with severity, there are no early biomarkers that can reliably predict patients who are more likely to have adverse outcomes. Thus, it is crucial to discover predictive markers of serious complications. Methods: In this retrospective cohort study, we analysed samples from 455 participants with COVID-19 who had had a positive SARS-CoV-2 RT-PCR result between April 14, 2020, and Dec 1, 2020 and who had visited one of three Mayo Clinic sites in the USA (Minnesota, Arizona, or Florida) in the same period. These participants were assigned to three subgroups depending on disease severity as defined by the WHO ordinal scale of clinical improvement (outpatient, severe, or critical). Our control cohort comprised of 182 anonymised age-matched and sex-matched plasma samples that were available from the Mayo Clinic Biorepository and banked before the COVID-19 pandemic. We did a deep profiling of circulatory cytokines and other proteins, lipids, and metabolites from both cohorts. Most patient samples were collected before, or around the time of, hospital admission, representing ideal samples for predictive biomarker discovery. We used proximity extension assays to quantify cytokines and circulatory proteins and tandem mass spectrometry to measure lipids and metabolites. Biomarker discovery was done by applying an AutoGluon-tabular classifier to a multiomics dataset, producing a stacked ensemble of cutting-edge machine learning algorithms. Global proteomics and glycoproteomics on a subset of patient samples with matched pre-COVID-19 plasma samples was also done. Findings: We quantified 1463 cytokines and circulatory proteins, along with 902 lipids and 1018 metabolites. By developing a machine-learning-based prediction model, a set of 102 biomarkers, which predicted severe and clinical COVID-19 outcomes better than the traditional set of cytokines, were discovered. These predictive biomarkers included several novel cytokines and other proteins, lipids, and metabolites. For example, altered amounts of C-type lectin domain family 6 member A (CLEC6A), ether phosphatidylethanolamine (P-18:1/18:1), and 2-hydroxydecanoate, as reported here, have not previously been associated with severity in COVID-19. Patient samples with matched pre-COVID-19 plasma samples showed similar trends in muti-omics signatures along with differences in glycoproteomics profile. Interpretation: A multiomic molecular signature in the plasma of patients with COVID-19 before being admitted to hospital can be exploited to predict a more severe course of disease. Machine learning approaches can be applied to highly complex and multidimensional profiling data to reveal novel signatures of clinical use. The absence of validation in an independent cohort remains a major limitation of the study. Funding: Eric and Wendy Schmidt.
UR - http://www.scopus.com/inward/record.url?scp=85136249004&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85136249004&partnerID=8YFLogxK
U2 - 10.1016/S2589-7500(22)00112-1
DO - 10.1016/S2589-7500(22)00112-1
M3 - Article
C2 - 35835712
AN - SCOPUS:85136249004
SN - 2589-7500
VL - 4
SP - e632-e645
JO - The Lancet Digital Health
JF - The Lancet Digital Health
IS - 9
ER -