Genomic data in the All of Us Research ProgramAbstract Comprehensively mapping the genetic basis of human disease across diverse individuals is a long-standing goal for the field of human genetics 1–4 . The All of Us Research Program is a longitudinal cohort study aiming to enrol a diverse group of at least one million individuals across the USA to accelerate biomedical research and improve human health 5,6 . Here we describe the programme’s genomics data release of 245,388 clinical-grade genome sequences. This resource is unique in its diversity as 77% of participants are from communities that are historically under-represented in biomedical research and 46% are individuals from under-represented racial and ethnic minorities. All of Us identified more than 1 billion genetic variants, including more than 275 million previously unreported genetic variants, more than 3.9 million of which had coding consequences. Leveraging linkage between genomic data and the longitudinal electronic health record, we evaluated 3,724 genetic variants associated with 117 diseases and found high replication rates across both participants of European ancestry and participants of African ancestry. Summary-level data are publicly available, and individual-level data can be accessed by researchers through the All of Us Researcher Workbench using a unique data passport model with a median time from initial researcher registration to data access of 29 hours. We anticipate that this diverse dataset will advance the promise of genomic medicine for all.
Type 2 diabetes: Evidence for linkage on chromosome 20 in 716 Finnish affected sib pairsSoumitra Ghosh, Richard M. Watanabe, Elizabeth R. Hauser et al.|Proceedings of the National Academy of Sciences|1999 We are conducting a genome scan at an average resolution of 10 centimorgans (cM) for type 2 diabetes susceptibility genes in 716 affected sib pairs from 477 Finnish families. To date, our best evidence for linkage is on chromosome 20 with potentially separable peaks located on both the long and short arms. The unweighted multipoint maximum logarithm of odds score (MLS) was 3.08 on 20p (location, chi = 19.5 cM) under an additive model, whereas the weighted MLS was 2.06 on 20q (chi = 57 cM, recurrence risk,lambda(s) = 1. 25, P = 0.009). Weighted logarithm of odds scores of 2.00 (chi = 69.5 cM, P = 0.010) and 1.92 (chi = 18.5 cM, P = 0.013) were also observed. Ordered subset analyses based on sibships with extreme mean values of diabetes-related quantitative traits yielded sets of families who contributed disproportionately to the peaks. Two-hour glucose levels in offspring of diabetic individuals gave a MLS of 2. 12 (P = 0.0018) at 9.5 cM. Evidence from this and other studies suggests at least two diabetes-susceptibility genes on chromosome 20. We have also screened the gene for maturity-onset diabetes of the young 1, hepatic nuclear factor 4-a (HNF-4alpha) in 64 affected sibships with evidence for high chromosomal sharing at its location on chromosome 20q. We found no evidence that sequence changes in this gene accounted for the linkage results we observed.
The Finland–United States Investigation of Non–Insulin-Dependent Diabetes Mellitus Genetics (FUSION) Study. I. An Autosomal Genome Scan for Genes That Predispose to Type 2 DiabetesSoumitra Ghosh, Richard M. Watanabe, Timo T. Valle et al.|The American Journal of Human Genetics|2000 Selection, optimization and validation of ten chronic disease polygenic risk scores for clinical implementation in diverse US populationsPolygenic risk scores (PRSs) have improved in predictive performance, but several challenges remain to be addressed before PRSs can be implemented in the clinic, including reduced predictive performance of PRSs in diverse populations, and the interpretation and communication of genetic results to both providers and patients. To address these challenges, the National Human Genome Research Institute-funded Electronic Medical Records and Genomics (eMERGE) Network has developed a framework and pipeline for return of a PRS-based genome-informed risk assessment to 25,000 diverse adults and children as part of a clinical study. From an initial list of 23 conditions, ten were selected for implementation based on PRS performance, medical actionability and potential clinical utility, including cardiometabolic diseases and cancer. Standardized metrics were considered in the selection process, with additional consideration given to strength of evidence in African and Hispanic populations. We then developed a pipeline for clinical PRS implementation (score transfer to a clinical laboratory, validation and verification of score performance), and used genetic ancestry to calibrate PRS mean and variance, utilizing genetically diverse data from 13,475 participants of the All of Us Research Program cohort to train and test model parameters. Finally, we created a framework for regulatory compliance and developed a PRS clinical report for return to providers and for inclusion in an additional genome-informed risk assessment. The initial experience from eMERGE can inform the approach needed to implement PRS-based testing in diverse clinical settings.
The Finland–United States Investigation of Non–Insulin-Dependent Diabetes Mellitus Genetics (FUSION) Study. II. An Autosomal Genome Scan for Diabetes-Related Quantitative-Trait LociRichard M. Watanabe, Soumitra Ghosh, Carl D. Langefeld et al.|The American Journal of Human Genetics|2000