TY - JOUR
T1 - Principled multi-omic analysis reveals gene regulatory mechanisms of phenotype variation
AU - Hanson, Casey
AU - Cairns, Junmei
AU - Wang, Liewei
AU - Sinha, Saurabh
N1 - Funding Information:
This work was provided in part by the National Institutes of Health (R01 GM114341 to S.S., grant U54 GM114838 to S.S., U19 GM61388 Pharmacogenomics Research Network and R01 CA138461 to L.W.), in part by the Mayo Clinic-UIUC Alliance, and by grant 1U54GM114838 awarded by the National Institute of General Medical Sciences through funds provided by the trans-NIH Big Data to Knowledge (BD2K) initiative (www.bd2k.nih.gov). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Funding Information:
Funding for this work was provided in part by the National Institutes of Health (R01 GM114341 to S.S., grant U54 GM114838 to S.S., U19 GM61388 Pharmacogenomics Research Network and R01 CA138461 to L.W.), in part by the Mayo Clinic-UIUC Alliance, and by grant 1U54GM114838 awarded by the National Institute of General Medical Sciences through funds provided by the trans-NIH Big Data to Knowledge (BD2K) initiative (www.bd2k.nih.gov). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Publisher Copyright:
© 2018 Hanson et al.
PY - 2018/8
Y1 - 2018/8
N2 - Recent studies have analyzed large-scale data sets of gene expression to identify genes associated with interindividual variation in phenotypes ranging from cancer subtypes to drug sensitivity, promising new avenues of research in personalized medicine. However, gene expression data alone is limited in its ability to reveal cis-regulatory mechanisms underlying phenotypic differences. In this study, we develop a new probabilistic model, called pGENMi, that integrates multi-omic data to investigate the transcriptional regulatory mechanisms underlying interindividual variation of a specific phenotype'that of cell line response to cytotoxic treatment. In particular, pGENMi simultaneously analyzes genotype, DNA methylation, gene expression, and transcription factor (TF)-DNA binding data, along with phenotypic measurements, to identify TFs regulating the phenotype. It does so by combining statistical information about expression quantitative trait loci (eQTLs) and expression-correlated methylation marks (eQTMs) located within TF binding sites, as well as observed correlations between gene expression and phenotype variation. Application of pGENMi to data from a panel of lymphoblastoid cell lines treated with 24 drugs, in conjunction with ENCODE TF ChIP data, yielded a number of known as well as novel (TF, Drug) associations. Experimental validations by TF knockdown confirmed 41% of the predicted and tested associations, compared to a 12% confirmation rate of tested nonassociations (controls). An extensive literature survey also corroborated 62% of the predicted associations above a stringent threshold. Moreover, associations predicted only when combining eQTL and eQTM data showed higher precision compared to an eQTL-only or eQTM-only analysis using pGENMi, further demonstrating the value of multi-omic integrative analysis.
AB - Recent studies have analyzed large-scale data sets of gene expression to identify genes associated with interindividual variation in phenotypes ranging from cancer subtypes to drug sensitivity, promising new avenues of research in personalized medicine. However, gene expression data alone is limited in its ability to reveal cis-regulatory mechanisms underlying phenotypic differences. In this study, we develop a new probabilistic model, called pGENMi, that integrates multi-omic data to investigate the transcriptional regulatory mechanisms underlying interindividual variation of a specific phenotype'that of cell line response to cytotoxic treatment. In particular, pGENMi simultaneously analyzes genotype, DNA methylation, gene expression, and transcription factor (TF)-DNA binding data, along with phenotypic measurements, to identify TFs regulating the phenotype. It does so by combining statistical information about expression quantitative trait loci (eQTLs) and expression-correlated methylation marks (eQTMs) located within TF binding sites, as well as observed correlations between gene expression and phenotype variation. Application of pGENMi to data from a panel of lymphoblastoid cell lines treated with 24 drugs, in conjunction with ENCODE TF ChIP data, yielded a number of known as well as novel (TF, Drug) associations. Experimental validations by TF knockdown confirmed 41% of the predicted and tested associations, compared to a 12% confirmation rate of tested nonassociations (controls). An extensive literature survey also corroborated 62% of the predicted associations above a stringent threshold. Moreover, associations predicted only when combining eQTL and eQTM data showed higher precision compared to an eQTL-only or eQTM-only analysis using pGENMi, further demonstrating the value of multi-omic integrative analysis.
UR - http://www.scopus.com/inward/record.url?scp=85050889241&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85050889241&partnerID=8YFLogxK
U2 - 10.1101/gr.227066.117
DO - 10.1101/gr.227066.117
M3 - Article
C2 - 29898900
AN - SCOPUS:85050889241
SN - 1088-9051
VL - 28
SP - 1207
EP - 1216
JO - Genome Research
JF - Genome Research
IS - 8
ER -