Improved analyses of GWAS summary statistics by reducing data heterogeneity and errors

Wenhan Chen(Garvan Institute of Medical Research), Yang Wu(The University of Queensland), Zhili Zheng(The University of Queensland), Ting Qi(The University of Queensland), Peter M. Visscher(The University of Queensland), Zhihong Zhu(The University of Queensland), Jian Yang(The University of Queensland)
Nature Communications
December 8, 2021
Cited by 70Open Access
Full Text

Abstract

statistics from genome-wide association studies (GWAS) have facilitated the development of various summary data-based methods, which typically require a reference sample for linkage disequilibrium (LD) estimation. Analyses using these methods may be biased by errors in GWAS summary data or LD reference or heterogeneity between GWAS and LD reference. Here we propose a quality control method, DENTIST, that leverages LD among genetic variants to detect and eliminate errors in GWAS or LD reference and heterogeneity between the two. Through simulations, we demonstrate that DENTIST substantially reduces false-positive rate in detecting secondary signals in the summary-data-based conditional and joint association analysis, especially for imputed rare variants (false-positive rate reduced from >28% to <2% in the presence of heterogeneity between GWAS and LD reference). We further show that DENTIST can improve other summary-data-based analyses such as fine-mapping analysis.


Related Papers

No related papers found

Powered by citation graph analysis