Publishes on Genomics and Chromatin Dynamics, Genetic Associations and Epidemiology, Genomic variations and chromosomal abnormalities. 352 papers and 85.8k citations.
The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.
The contribution of rare and low-frequency variants to human traits is largely unexplored. Here we describe insights from sequencing whole genomes (low read depth, 7×) or exomes (high read depth, 80×) of nearly 10,000 individuals from population-based and disease collections. In extensively phenotyped cohorts we characterize over 24 million novel sequence variants, generate a highly accurate imputation reference panel and identify novel alleles associated with levels of triglycerides (APOB), adiponectin (ADIPOQ) and low-density lipoprotein cholesterol (LDLR and RGAG1) from single-marker and rare variant aggregation tests. We describe population structure and functional annotation of rare and low-frequency variants, use the data to estimate the benefits of sequencing for association studies, and summarize lessons from disease-specific collections. Finally, we make available an extensive resource, including individual-level genetic and phenotypic data and web-based tools to facilitate the exploration of association results. Low read depth sequencing of whole genomes and high read depth exomes of nearly 10,000 extensively phenotyped individuals are combined to help characterize novel sequence variants, generate a highly accurate imputation reference panel and identify novel alleles associated with lipid-related traits; in addition to describing population structure and providing functional annotation of rare and low-frequency variants the authors use the data to estimate the benefits of sequencing for association studies. This paper, combining data and initial findings from the different arms of the UK10K project, describes insights from low-read-depth sequencing of whole genomes or high-read-depth exome sequencing of nearly 10,000 individuals sampled from a range of disease collections, as well as participants from healthy population based cohorts. The authors characterize novel sequence variants, generate a highly accurate imputation reference panel and identify novel alleles associated with lipid-related traits. In addition to describing population structure and providing functional annotation of rare and low frequency variants, they use the data to estimate the benefits of sequencing for association studies.