Characterizing Race/Ethnicity and Genetic Ancestry for 100,000 Subjects in the Genetic Epidemiology Research on Adult Health and Aging (GERA) Cohort

Yambazi Banda(University of California, San Francisco), Mark Kvale(University of California, San Francisco), Thomas J. Hoffmann(University of California, San Francisco), Stephanie Hesselson(University of California, San Francisco), Dilrini K. Ranatunga(Kaiser Permanente), Hua Tang(Stanford University), Chiara Sabatti(Stanford University), Lisa Croen(Kaiser Permanente), Brad Dispensa(University of California, San Francisco), Mary Henderson(Kaiser Permanente), Carlos Iribarren(Kaiser Permanente), Eric Jorgenson(Kaiser Permanente), Lawrence H. Kushi(Kaiser Permanente), Dana Ludwig(Kaiser Permanente), Diane Olberg(Kaiser Permanente), Charles P. Quesenberry(Kaiser Permanente), Sarah Rowell(Kaiser Permanente), Marianne Sadler(Kaiser Permanente), Lori C. Sakoda(Kaiser Permanente), Stanley Sciortino(Kaiser Permanente), Ling Shen(Kaiser Permanente), David Smethurst(Kaiser Permanente), Carol P. Somkin(Kaiser Permanente), Stephen K. Van Den Eeden(Kaiser Permanente), Lawrence Walter(Kaiser Permanente), Rachel A. Whitmer(Kaiser Permanente), Pui–Yan Kwok(University of California, San Francisco), Catherine Schaefer(Kaiser Permanente), Neil Risch(Kaiser Permanente)
Genetics
June 19, 2015
Cited by 355Open Access
Full Text

Abstract

Using genome-wide genotypes, we characterized the genetic structure of 103,006 participants in the Kaiser Permanente Northern California multi-ethnic Genetic Epidemiology Research on Adult Health and Aging Cohort and analyzed the relationship to self-reported race/ethnicity. Participants endorsed any of 23 race/ethnicity/nationality categories, which were collapsed into seven major race/ethnicity groups. By self-report the cohort is 80.8% white and 19.2% minority; 93.8% endorsed a single race/ethnicity group, while 6.2% endorsed two or more. Principal component (PC) and admixture analyses were generally consistent with prior studies. Approximately 17% of subjects had genetic ancestry from more than one continent, and 12% were genetically admixed, considering only nonadjacent geographical origins. Self-reported whites were spread on a continuum along the first two PCs, indicating extensive mixing among European nationalities. Self-identified East Asian nationalities correlated with genetic clustering, consistent with extensive endogamy. Individuals of mixed East Asian-European genetic ancestry were easily identified; we also observed a modest amount of European genetic ancestry in individuals self-identified as Filipinos. Self-reported African Americans and Latinos showed extensive European and African genetic ancestry, and Native American genetic ancestry for the latter. Among 3741 genetically identified parent-child pairs, 93% were concordant for self-reported race/ethnicity; among 2018 genetically identified full-sib pairs, 96% were concordant; the lower rate for parent-child pairs was largely due to intermarriage. The parent-child pairs revealed a trend toward increasing exogamy over time; the presence in the cohort of individuals endorsing multiple race/ethnicity categories creates interesting challenges and future opportunities for genetic epidemiologic studies.


Related Papers

No related papers found

Powered by citation graph analysis