TY - JOUR
T1 - Peak learning of mass spectrometry imaging data using artificial neural networks
AU - Abdelmoula, Walid M.
AU - Lopez, Begona Gimenez Cassina
AU - Randall, Elizabeth C.
AU - Kapur, Tina
AU - Sarkaria, Jann N.
AU - White, Forest M.
AU - Agar, Jeffrey N.
AU - Wells, William M.
AU - Agar, Nathalie Y.R.
N1 - Funding Information:
We would like to acknowledge the contribution from Dr. Sylwia Stopka in revising the manuscript. This work was funded by NIH U54 CA210180 MIT/Mayo Physical Science Oncology Center for Drug Distribution and Drug Efficacy in Brain Tumors, and by the Dana-Farber Cancer Institute PLGA Fund. E.C.R. was in receipt of an NIH R25 (R25 CA-89017). NYRA receives support from the Advanced Technologies-National Center for Image Guided Therapy (AT-NCIGT) NIH P41EB028741 and NIH R01CA201469.
Publisher Copyright:
© 2021, The Author(s).
PY - 2021/12/1
Y1 - 2021/12/1
N2 - Mass spectrometry imaging (MSI) is an emerging technology that holds potential for improving, biomarker discovery, metabolomics research, pharmaceutical applications and clinical diagnosis. Despite many solutions being developed, the large data size and high dimensional nature of MSI, especially 3D datasets, still pose computational and memory complexities that hinder accurate identification of biologically relevant molecular patterns. Moreover, the subjectivity in the selection of parameters for conventional pre-processing approaches can lead to bias. Therefore, we assess if a probabilistic generative model based on a fully connected variational autoencoder can be used for unsupervised analysis and peak learning of MSI data to uncover hidden structures. The resulting msiPL method learns and visualizes the underlying non-linear spectral manifold, revealing biologically relevant clusters of tissue anatomy in a mouse kidney and tumor heterogeneity in human prostatectomy tissue, colorectal carcinoma, and glioblastoma mouse model, with identification of underlying m/z peaks. The method is applied for the analysis of MSI datasets ranging from 3.3 to 78.9 GB, without prior pre-processing and peak picking, and acquired using different mass spectrometers at different centers.
AB - Mass spectrometry imaging (MSI) is an emerging technology that holds potential for improving, biomarker discovery, metabolomics research, pharmaceutical applications and clinical diagnosis. Despite many solutions being developed, the large data size and high dimensional nature of MSI, especially 3D datasets, still pose computational and memory complexities that hinder accurate identification of biologically relevant molecular patterns. Moreover, the subjectivity in the selection of parameters for conventional pre-processing approaches can lead to bias. Therefore, we assess if a probabilistic generative model based on a fully connected variational autoencoder can be used for unsupervised analysis and peak learning of MSI data to uncover hidden structures. The resulting msiPL method learns and visualizes the underlying non-linear spectral manifold, revealing biologically relevant clusters of tissue anatomy in a mouse kidney and tumor heterogeneity in human prostatectomy tissue, colorectal carcinoma, and glioblastoma mouse model, with identification of underlying m/z peaks. The method is applied for the analysis of MSI datasets ranging from 3.3 to 78.9 GB, without prior pre-processing and peak picking, and acquired using different mass spectrometers at different centers.
UR - http://www.scopus.com/inward/record.url?scp=85115427624&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85115427624&partnerID=8YFLogxK
U2 - 10.1038/s41467-021-25744-8
DO - 10.1038/s41467-021-25744-8
M3 - Article
C2 - 34545087
AN - SCOPUS:85115427624
SN - 2041-1723
VL - 12
JO - Nature communications
JF - Nature communications
IS - 1
M1 - 5544
ER -