TY - JOUR
T1 - Variant-set association test for generalized linear mixed model
AU - Zhan, Xiang
AU - Banerjee, Kalins
AU - Chen, Jun
N1 - Funding Information:
The authors thank Dr. Andrea Schneider and Dr. Amanda Nelson for helpful discussion on an earlier draft of this manuscript. This study was, in part, supported by the National Institute of Allergy and Infectious Diseases of the National Institutes of Health (Grant No. R21AI144765), the Division of Mathematical Sciences of the National Science Foundation (Grant No. 1953189), and Mayo Clinic Center for Individualized Medicine. Computations for this study were performed on the Pennsylvania State University's Institute for Computational and Data Sciences' Roar supercomputer. The authors wish to thank the editor and two anonymous referees for their insightful comments and suggestions that have improved the paper.
Publisher Copyright:
© 2021 Wiley Periodicals LLC
PY - 2021/6
Y1 - 2021/6
N2 - Advances in high-throughput biotechnologies have culminated in a wide range of omics (such as genomics, epigenomics, transcriptomics, metabolomics, and metagenomics) studies, and increasing evidence in these studies indicates that the biological architecture of complex traits involves a large number of omics variants each with minor effects but collectively accounting for the full phenotypic variability. Thus, a major challenge in many “ome-wide” association analyses is to achieve adequate statistical power to identify multiple variants of small effect sizes, which is notoriously difficult for studies with relatively small-sample sizes. A small-sample adjustment incorporated in the kernel machine regression framework was proposed to solve this for association studies under various settings. However, such an adjustment in the generalized linear mixed model (GLMM) framework, which accounts for both sample relatedness and non-Gaussian outcomes, has not yet been attempted. In this study, we fill this gap by extending small-sample adjustment in kernel machine association test to GLMM. We propose a new Variant-Set Association Test (VSAT), a powerful and efficient analysis tool in GLMM, to examine the association between a set of omics variants and correlated phenotypes. The usefulness of VSAT is demonstrated using both numerical simulation studies and applications to data collected from multiple association studies. The software for implementing the proposed method in R is available at https://www.github.com/jchen1981/SSKAT.
AB - Advances in high-throughput biotechnologies have culminated in a wide range of omics (such as genomics, epigenomics, transcriptomics, metabolomics, and metagenomics) studies, and increasing evidence in these studies indicates that the biological architecture of complex traits involves a large number of omics variants each with minor effects but collectively accounting for the full phenotypic variability. Thus, a major challenge in many “ome-wide” association analyses is to achieve adequate statistical power to identify multiple variants of small effect sizes, which is notoriously difficult for studies with relatively small-sample sizes. A small-sample adjustment incorporated in the kernel machine regression framework was proposed to solve this for association studies under various settings. However, such an adjustment in the generalized linear mixed model (GLMM) framework, which accounts for both sample relatedness and non-Gaussian outcomes, has not yet been attempted. In this study, we fill this gap by extending small-sample adjustment in kernel machine association test to GLMM. We propose a new Variant-Set Association Test (VSAT), a powerful and efficient analysis tool in GLMM, to examine the association between a set of omics variants and correlated phenotypes. The usefulness of VSAT is demonstrated using both numerical simulation studies and applications to data collected from multiple association studies. The software for implementing the proposed method in R is available at https://www.github.com/jchen1981/SSKAT.
KW - generalized linear mixed model
KW - kernel machine regression
KW - omics variants
KW - small sample
KW - variant-set association test
UR - http://www.scopus.com/inward/record.url?scp=85100981918&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85100981918&partnerID=8YFLogxK
U2 - 10.1002/gepi.22378
DO - 10.1002/gepi.22378
M3 - Article
C2 - 33604919
AN - SCOPUS:85100981918
SN - 0741-0395
VL - 45
SP - 402
EP - 412
JO - Genetic epidemiology
JF - Genetic epidemiology
IS - 4
ER -