K

Kristen M. Connolly

Broad Institute

Publishes on Genomics and Rare Diseases, Genomics and Phylogenetic Studies, Genomic variations and chromosomal abnormalities. 24 papers and 15.8k citations.

24Publications
15.8kTotal Citations

Is this you? Claim your profile.

Add your photo, update your bio, and get notified when your ranking changes.

Top publicationsby citations

The mutational constraint spectrum quantified from variation in 141,456 humans
Cited by 10kOpen Access

Abstract Genetic variants that inactivate protein-coding genes are a powerful source of information about the phenotypic consequences of gene disruption: genes that are crucial for the function of an organism will be depleted of such variants in natural populations, whereas non-essential genes will tolerate their accumulation. However, predicted loss-of-function variants are enriched for annotation errors, and tend to be found at extremely low frequencies, so their analysis requires careful variant annotation and very large sample sizes 1 . Here we describe the aggregation of 125,748 exomes and 15,708 genomes from human sequencing studies into the Genome Aggregation Database (gnomAD). We identify 443,769 high-confidence predicted loss-of-function variants in this cohort after filtering for artefacts caused by sequencing and annotation errors. Using an improved model of human mutation rates, we classify human protein-coding genes along a spectrum that represents tolerance to inactivation, validate this classification using data from model organisms and engineered human cells, and show that it can be used to improve the power of gene discovery for both common and rare diseases.

The mutational constraint spectrum quantified from variation in 141,456 humans
Konrad J. Karczewski, Laurent C. Francioli, Grace Tiao et al.|bioRxiv (Cold Spring Harbor Laboratory)|2019
Cited by 1.8kOpen Access

Summary Genetic variants that inactivate protein-coding genes are a powerful source of information about the phenotypic consequences of gene disruption: genes critical for an organism’s function will be depleted for such variants in natural populations, while non-essential genes will tolerate their accumulation. However, predicted loss-of-function (pLoF) variants are enriched for annotation errors, and tend to be found at extremely low frequencies, so their analysis requires careful variant annotation and very large sample sizes 1 . Here, we describe the aggregation of 125,748 exomes and 15,708 genomes from human sequencing studies into the Genome Aggregation Database (gnomAD). We identify 443,769 high-confidence pLoF variants in this cohort after filtering for sequencing and annotation artifacts. Using an improved human mutation rate model, we classify human protein-coding genes along a spectrum representing tolerance to inactivation, validate this classification using data from model organisms and engineered human cells, and show that it can be used to improve gene discovery power for both common and rare diseases.

A structural variation reference for medical and population genetics
Cited by 1.2kOpen Access

Abstract Structural variants (SVs) rearrange large segments of DNA 1 and can have profound consequences in evolution and human disease 2,3 . As national biobanks, disease-association studies, and clinical genetic testing have grown increasingly reliant on genome sequencing, population references such as the Genome Aggregation Database (gnomAD) 4 have become integral in the interpretation of single-nucleotide variants (SNVs) 5 . However, there are no reference maps of SVs from high-coverage genome sequencing comparable to those for SNVs. Here we present a reference of sequence-resolved SVs constructed from 14,891 genomes across diverse global populations (54% non-European) in gnomAD. We discovered a rich and complex landscape of 433,371 SVs, from which we estimate that SVs are responsible for 25–29% of all rare protein-truncating events per genome. We found strong correlations between natural selection against damaging SNVs and rare SVs that disrupt or duplicate protein-coding sequence, which suggests that genes that are highly intolerant to loss-of-function are also sensitive to increased dosage 6 . We also uncovered modest selection against noncoding SVs in cis -regulatory elements, although selection against protein-truncating SVs was stronger than all noncoding effects. Finally, we identified very large (over one megabase), rare SVs in 3.9% of samples, and estimate that 0.13% of individuals may carry an SV that meets the existing criteria for clinically important incidental findings 7 . This SV resource is freely distributed via the gnomAD browser 8 and will have broad utility in population genetics, disease-association studies, and diagnostic screening.

Genetic Diversity and Protective Efficacy of the RTS,S/AS01 Malaria Vaccine
Daniel E. Neafsey, Michal Juraska, Trevor Bedford et al.|New England Journal of Medicine|2015
Cited by 450Open Access

BACKGROUND: The RTS,S/AS01 vaccine targets the circumsporozoite protein of Plasmodium falciparum and has partial protective efficacy against clinical and severe malaria disease in infants and children. We investigated whether the vaccine efficacy was specific to certain parasite genotypes at the circumsporozoite protein locus. METHODS: We used polymerase chain reaction-based next-generation sequencing of DNA extracted from samples from 4985 participants to survey circumsporozoite protein polymorphisms. We evaluated the effect that polymorphic positions and haplotypic regions within the circumsporozoite protein had on vaccine efficacy against first episodes of clinical malaria within 1 year after vaccination. RESULTS: In the per-protocol group of 4577 RTS,S/AS01-vaccinated participants and 2335 control-vaccinated participants who were 5 to 17 months of age, the 1-year cumulative vaccine efficacy was 50.3% (95% confidence interval [CI], 34.6 to 62.3) against clinical malaria in which parasites matched the vaccine in the entire circumsporozoite protein C-terminal (139 infections), as compared with 33.4% (95% CI, 29.3 to 37.2) against mismatched malaria (1951 infections) (P=0.04 for differential vaccine efficacy). The vaccine efficacy based on the hazard ratio was 62.7% (95% CI, 51.6 to 71.3) against matched infections versus 54.2% (95% CI, 49.9 to 58.1) against mismatched infections (P=0.06). In the group of infants 6 to 12 weeks of age, there was no evidence of differential allele-specific vaccine efficacy. CONCLUSIONS: These results suggest that among children 5 to 17 months of age, the RTS,S vaccine has greater activity against malaria parasites with the matched circumsporozoite protein allele than against mismatched malaria. The overall vaccine efficacy in this age category will depend on the proportion of matched alleles in the local parasite population; in this trial, less than 10% of parasites had matched alleles. (Funded by the National Institutes of Health and others.).