General Kernel Machine Methods for Multi‐Omics Integration and Genome‐Wide Association Testing With Related Individuals

Amarise Little(University of Washington), Ni Zhao(Johns Hopkins University), Anna V. Mikhaylova(University of Washington), Angela Zhang(University of Washington), Wodan Ling(Cornell University), Florian Thibord(National Heart Lung and Blood Institute), Andrew D. Johnson(National Heart Lung and Blood Institute), Laura M. Raffield(University of North Carolina at Chapel Hill), Joanne E. Curran(The University of Texas Rio Grande Valley), John Blangero(The University of Texas Rio Grande Valley), Jeffrey R. O’Connell(University of Maryland, Baltimore), Huichun Xu(University of Maryland, Baltimore), Jerome I. Rotter(UCLA Medical Center), Stephen S. Rich(University of Virginia), Kenneth Rice(University of Washington), Ming‐Huei Chen(National Heart Lung and Blood Institute), Alexander P. Reiner(Fred Hutch Cancer Center), Charles Kooperberg(Fred Hutch Cancer Center), Thao Vu(University of Colorado Anschutz Medical Campus), Lifang Hou(Northwestern University), Myriam Fornage(Brown Foundation), Ruth J. F. Loos(University of Copenhagen), Eimear E. Kenny(Genomic Health (United States)), Rasika A. Mathias(Johns Hopkins University), Lewis C. Becker(Johns Hopkins University), Albert V. Smith(University of Michigan), Eric Boerwinkle(Baylor College of Medicine), Bing Yu(The University of Texas Health Science Center at Houston), Timothy Thornton(University of Washington), Michael C. Wu(University of Washington)
Genetic Epidemiology
January 15, 2025
Cited by 1Open Access
Full Text

Abstract

Integrating multi-omics data may help researchers understand the genetic underpinnings of complex traits and diseases. However, the best ways to integrate multi-omics data and use them to address pressing scientific questions remain a challenge. One important and topical problem is how to assess the aggregate effect of multiple genomic data types (e.g. genotypes and gene expression levels) on a phenotype, particularly while accommodating routine issues, such as having related subjects' data in analyses. In this paper, we extend an existing composite kernel machine regression model to integrate two multi-omics data types, while accommodating for general correlation structures amongst outcomes. Due to the kernel machine regression framework, our methods allow for the integration of high-dimensional omics data with small, nonlinear, and interactive effects, and accommodation of general study designs. Here, we focus on scientific questions that aim to assess the association between a functional grouping (such as a gene or a pathway) and a quantitative trait of interest. We use a kernel machine regression to integrate the two multi-omics data types, as they may relate to the trait, and perform a global test of association. We demonstrate the advantage of this approach over single data type association tests via simulation. Finally, we apply this method to a large, multi-ethnic data set to investigate how predicted gene expression and rare genetic variation may be related to two platelet traits.


Related Papers

No related papers found

Powered by citation graph analysis