Insights into human genetic variation and population history from 929 diverse genomesGenome sequences from diverse human groups are needed to understand the structure of genetic variation in our species and the history of, and relationships between, different populations. We present 929 high-coverage genome sequences from 54 diverse human populations, 26 of which are physically phased using linked-read sequencing. Analyses of these genomes reveal an excess of previously undocumented common genetic variation private to southern Africa, central Africa, Oceania, and the Americas, but an absence of such variants fixed between major geographical regions. We also find deep and gradual population separations within Africa, contrasting population size histories between hunter-gatherer and agriculturalist groups in the past 10,000 years, and a contrast between single Neanderthal but multiple Denisovan source populations contributing to present-day human populations.
Dense fine-mapping study identifies new susceptibility loci for primary biliary cirrhosisInsights into human genetic variation and population history from 929 diverse genomesAnders Bergström, Shane McCarthy, Ruoyun Hui et al.|bioRxiv (Cold Spring Harbor Laboratory)|2019 Abstract Genome sequences from diverse human groups are needed to understand the structure of genetic variation in our species and the history of, and relationships between, different populations. We present 929 high-coverage genome sequences from 54 diverse human populations, 26 of which are physically phased using linked-read sequencing. Analyses of these genomes reveal an excess of previously undocumented private genetic variation in southern and central Africa and in Oceania and the Americas, but an absence of fixed, private variants between major geographical regions. We also find deep and gradual population separations within Africa, contrasting population size histories between hunter-gatherer and agriculturalist groups in the last 10,000 years, a potentially major population growth episode after the peopling of the Americas, and a contrast between single Neanderthal but multiple Denisovan source populations contributing to present-day human populations. We also demonstrate benefits to the study of population relationships of genome sequences over ascertained array genotypes. These genome sequences are freely available as a resource with no access or analysis restrictions.
Global Public Perceptions of Genomic Data Sharing: What Shapes the Willingness to Donate DNA and Health Data?Anna Middleton, Richard Milne, Mohamed A. Almarri et al.|The American Journal of Human Genetics|2020 Analyzing genomic data across populations is central to understanding the role of genetic factors in health and disease. Successful data sharing relies on public support, which requires attention to whether people around the world are willing to donate their data that are then subsequently shared with others for research. However, studies of such public perceptions are geographically limited and do not enable comparison. This paper presents results from a very large public survey on attitudes toward genomic data sharing. Data from 36,268 individuals across 22 countries (gathered in 15 languages) are presented. In general, publics across the world do not appear to be aware of, nor familiar with, the concepts of DNA, genetics, and genomics. Willingness to donate one's DNA and health data for research is relatively low, and trust in the process of data's being shared with multiple users (e.g., doctors, researchers, governments) is also low. Participants were most willing to donate DNA or health information for research when the recipient was specified as a medical doctor and least willing to donate when the recipient was a for-profit researcher. Those who were familiar with genetics and who were trusting of the users asking for data were more likely to be willing to donate. However, less than half of participants trusted more than one potential user of data, although this varied across countries. Genetic information was not uniformly seen as different from other forms of health information, but there was an association between seeing genetic information as special in some way compared to other health data and increased willingness to donate. The global perspective provided by our "Your DNA, Your Say" study is valuable for informing the development of international policy and practice for sharing genomic data. It highlights that the research community not only needs to be worthy of trust by the public, but also urgent steps need to be taken to authentically communicate why genomic research is necessary and how data donation, and subsequent sharing, is integral to this.
Population Structure, Stratification, and Introgression of Human Structural VariationStructural variants contribute substantially to genetic diversity and are important evolutionarily and medically, but they are still understudied. Here we present a comprehensive analysis of structural variation in the Human Genome Diversity panel, a high-coverage dataset of 911 samples from 54 diverse worldwide populations. We identify, in total, 126,018 variants, 78% of which were not identified in previous global sequencing projects. Some reach high frequency and are private to continental groups or even individual populations, including regionally restricted runaway duplications and putatively introgressed variants from archaic hominins. By de novo assembly of 25 genomes using linked-read sequencing, we discover 1,643 breakpoint-resolved unique insertions, in aggregate accounting for 1.9 Mb of sequence absent from the GRCh38 reference. Our results illustrate the limitation of a single human reference and the need for high-quality genomes from diverse populations to fully discover and understand human genetic variation.