Winner's Curse Correction and Variable Thresholding Improve Performance of Polygenic Risk Modeling Based on Genome-Wide Association Study Summary-Level Data

Jianxin Shi(National Cancer Institute), Ju‐Hyun Park(Dongguk University), Jubao Duan(NorthShore University HealthSystem), Sonja T. Berndt(National Cancer Institute), Winton Moy(Northern Illinois University), Kai Yu(National Cancer Institute), Lei Song(National Cancer Institute), William Wheeler(Information Management Services), Xing Hua(National Cancer Institute), Debra T. Silverman(National Cancer Institute), Montserrat García‐Closas(National Cancer Institute), Chao A. Hsiung(National Health Research Institutes), Jonine D. Figueroa(National Cancer Institute), Victoria K. Cortessis(University of Southern California), Núria Malats(Spanish National Cancer Research Centre), Margaret R. Karagas(Dartmouth College), Paolo Vineis(Italian institute for Genomic Medicine), I‐Shou Chang(National Health Research Institutes), Dongxin Lin(Chinese Academy of Medical Sciences & Peking Union Medical College), Baosen Zhou(China Medical University), Adeline Seow(National University of Singapore), Keitaro Matsuo(Aichi Cancer Center), Yun‐Chul Hong(Seoul National University), Neil E. Caporaso(National Cancer Institute), Brian M. Wolpin(Brigham and Women's Hospital), Eric J. Jacobs(American Cancer Society), Gloria M. Petersen(Mayo Clinic in Florida), Alison P. Klein(Johns Hopkins University), Donghui Li(The University of Texas MD Anderson Cancer Center), Harvey A. Risch(Yale University), Alan R. Sanders(NorthShore University HealthSystem), Li Hsu(Fred Hutch Cancer Center), Robert E. Schoen(University of Pittsburgh Medical Center), Hermann Brenner(German Cancer Research Center), Rachael Z. Stolzenberg‐Solomon(National Cancer Institute), Pablo V. Gejman(NorthShore University HealthSystem), Qing Lan(National Cancer Institute), Nathaniel Rothman(National Cancer Institute), Laufey T. Ámundadóttir(National Cancer Institute), Maria Teresa Landi(National Cancer Institute), Douglas F. Levinson(Stanford University), Stephen J. Chanock(National Cancer Institute), Nilanjan Chatterjee(Johns Hopkins University)
PLoS Genetics
December 30, 2016
Cited by 147Open Access
Full Text

Abstract

Recent heritability analyses have indicated that genome-wide association studies (GWAS) have the potential to improve genetic risk prediction for complex diseases based on polygenic risk score (PRS), a simple modelling technique that can be implemented using summary-level data from the discovery samples. We herein propose modifications to improve the performance of PRS. We introduce threshold-dependent winner's-curse adjustments for marginal association coefficients that are used to weight the single-nucleotide polymorphisms (SNPs) in PRS. Further, as a way to incorporate external functional/annotation knowledge that could identify subsets of SNPs highly enriched for associations, we propose variable thresholds for SNPs selection. We applied our methods to GWAS summary-level data of 14 complex diseases. Across all diseases, a simple winner's curse correction uniformly led to enhancement of performance of the models, whereas incorporation of functional SNPs was beneficial only for selected diseases. Compared to the standard PRS algorithm, the proposed methods in combination led to notable gain in efficiency (25-50% increase in the prediction R2) for 5 of 14 diseases. As an example, for GWAS of type 2 diabetes, winner's curse correction improved prediction R2 from 2.29% based on the standard PRS to 3.10% (P = 0.0017) and incorporating functional annotation data further improved R2 to 3.53% (P = 2×10-5). Our simulation studies illustrate why differential treatment of certain categories of functional SNPs, even when shown to be highly enriched for GWAS-heritability, does not lead to proportionate improvement in genetic risk-prediction because of non-uniform linkage disequilibrium structure.


Related Papers

No related papers found

Powered by citation graph analysis