Broad Institute
ORCID: 0000-0003-4152-5471Publishes on Genomics and Rare Diseases, Genomic variations and chromosomal abnormalities, Autism Spectrum Disorder Research. 81 papers and 5.2k citations.
Add your photo, update your bio, and get notified when your ranking changes.
Abstract Structural variants (SVs) rearrange large segments of DNA 1 and can have profound consequences in evolution and human disease 2,3 . As national biobanks, disease-association studies, and clinical genetic testing have grown increasingly reliant on genome sequencing, population references such as the Genome Aggregation Database (gnomAD) 4 have become integral in the interpretation of single-nucleotide variants (SNVs) 5 . However, there are no reference maps of SVs from high-coverage genome sequencing comparable to those for SNVs. Here we present a reference of sequence-resolved SVs constructed from 14,891 genomes across diverse global populations (54% non-European) in gnomAD. We discovered a rich and complex landscape of 433,371 SVs, from which we estimate that SVs are responsible for 25–29% of all rare protein-truncating events per genome. We found strong correlations between natural selection against damaging SNVs and rare SVs that disrupt or duplicate protein-coding sequence, which suggests that genes that are highly intolerant to loss-of-function are also sensitive to increased dosage 6 . We also uncovered modest selection against noncoding SVs in cis -regulatory elements, although selection against protein-truncating SVs was stronger than all noncoding effects. Finally, we identified very large (over one megabase), rare SVs in 3.9% of samples, and estimate that 0.13% of individuals may carry an SV that meets the existing criteria for clinically important incidental findings 7 . This SV resource is freely distributed via the gnomAD browser 8 and will have broad utility in population genetics, disease-association studies, and diagnostic screening.
In yeast, SPT3 is a component of the multiprotein SPT-ADA-GCN5 acetyltransferase (SAGA) complex that integrates proteins with transcription coactivator/adaptor functions (ADAs and GCN5), histone acetyltransferase activity (GCN5), and core promoter-selective functions (SPTs) involving interactions with the TATA-binding protein (TBP). In particular, yeast SPT3 has been shown to interact directly with TBP. Here we report the molecular cloning of a cDNA encoding a human homologue of yeast SPT3. Amino acid sequence comparisons between human SPT3 (hSPT3) and its counterparts in different yeast species reveal three highly conserved domains, with the most conserved 92-amino acid N-terminal domain being 25% identical with human TAFII18. Despite the significant sequence similarity with TAFII18, native hSPT3 is not a bona fide TAFII because it is not associated in vivo either with human TBP/TFIID or with a TFIID-related TBP-free TAFII complex. However, we present evidence that hSPT3 is associated in vivo with TAFII31 and the recently described longer form of human GCN5 (hGCN5-L) in a novel human complex that has histone acetyltransferase activity. We propose that the human SPT3-TAFII31-GCN5-L acetyltransferase (STAGA) complex is a likely homologue of the yeast SAGA complex.