A high-resolution map of human evolutionary constraint using 29 mammals

Kerstin Lindblad‐Toh(Broad Institute), Manuel Garber(Broad Institute), Or Zuk(Massachusetts Institute of Technology), Michael F. Lin(Broad Institute), Brian J. Parker(University of Copenhagen), Stefan Washietl(Vassar College), Pouya Kheradpour(Massachusetts Institute of Technology), Jason Ernst(Massachusetts Institute of Technology), Gregory E. Jordan(Wellcome Trust), Evan Mauceli(Broad Institute), Lucas D. Ward(Broad Institute), Craig B. Lowe(Howard Hughes Medical Institute), Alisha K. Holloway(Gladstone Institutes), Michèle Clamp(Biofuel Research Team), Sante Gnerre(Broad Institute), Jessica Alföldi(Broad Institute), Kathryn Beal(Wellcome Trust), Jean L. Chang(Massachusetts Institute of Technology), Hiram Clawson(University of California, Santa Cruz), James Cuff(Harvard University), Federica Di Palma(Broad Institute), Stephen Fitzgerald(European Bioinformatics Institute), Paul Flicek(Wellcome Trust), Mitchell Guttman(Massachusetts Institute of Technology), Melissa J. Hubisz(Cornell University), David B. Jaffe(Massachusetts Institute of Technology), Irwin Jungreis(Vassar College), W. James Kent(Gladstone Institutes), Dennis Kostka(Gladstone Institutes), Marcia Lara(Broad Institute), André L. Martins(Cornell University), Tim Massingham(European Bioinformatics Institute), Ida Moltke(University of Copenhagen), Brian J. Raney(University of California, Santa Cruz), Matthew D. Rasmussen(Vassar College), Jim Robinson(Massachusetts Institute of Technology), Alexander Stark(Research Institute of Molecular Pathology), Albert J. Vilella(European Bioinformatics Institute), Jiayu Wen(University of Copenhagen), Xiaohui Xie(Broad Institute), Michael C. Zody(Broad Institute), Kim C. Worley(Baylor College of Medicine), Christie Kovar(Baylor College of Medicine), Donna M. Muzny(Baylor College of Medicine), Richard A. Gibbs(Baylor College of Medicine), Wesley C. Warren(Washington University in St. Louis), Elaine R. Mardis(Washington University in St. Louis), George M. Weinstock(Washington University in St. Louis), Richard K. Wilson(Washington University in St. Louis), Ewan Birney(Wellcome Trust), Elliott H. Margulies(National Human Genome Research Institute), Javier Herrero(European Bioinformatics Institute), Eric D. Green(National Human Genome Research Institute), David Haussler(University of California, Santa Cruz), Adam Siepel(Cornell University), Nick Goldman(Wellcome Trust), Katherine S. Pollard(Gladstone Institutes), Jakob Skou Pedersen(Aarhus University Hospital), Eric S. Lander(Massachusetts Institute of Technology), Manolis Kellis(Vassar College)
Nature
October 1, 2011
Cited by 1,210Open Access
Full Text

Abstract

The comparison of related genomes has emerged as a powerful lens for genome interpretation. Here we report the sequencing and comparative analysis of 29 eutherian genomes. We confirm that at least 5.5% of the human genome has undergone purifying selection, and locate constrained elements covering ∼4.2% of the genome. We use evolutionary signatures and comparisons with experimental data sets to suggest candidate functions for ∼60% of constrained bases. These elements reveal a small number of new coding exons, candidate stop codon readthrough events and over 10,000 regions of overlapping synonymous constraint within protein-coding exons. We find 220 candidate RNA structural families, and nearly a million elements overlapping potential promoter, enhancer and insulator regions. We report specific amino acid residues that have undergone positive selection, 280,000 non-coding elements exapted from mobile elements and more than 1,000 primate- and human-accelerated elements. Overlap with disease-associated variants indicates that our findings will be relevant for studies of human biology, health and disease. This comparative genomics study, comparing the complete human genome sequence with those of 29 placental mammals, including chimpanzees, mice and dogs, identifies 4.2% of the human genome as constrained by evolutionary selection, and ascribes a potential function to about 60% of these constrained bases. A series of evolutionary signatures emerges, providing insights into coding and non-coding functional genomic elements, candidate RNA structural families and aspects of genome organization and evolution. Overlap with disease-associated variants indicates that the findings will be relevant for studies of human disease.


Related Papers

No related papers found

Powered by citation graph analysis