M

Manolis Kellis

Broad Institute

Publishes on Cancer Genomics and Diagnostics, Genomics and Chromatin Dynamics, Epigenetics and DNA Methylation. 21 papers and 10.7k citations.

21Publications
10.7kTotal Citations
#9in Gene Regulation

Is this you? Claim your profile.

Add your photo, update your bio, and get notified when your ranking changes.

Top publicationsby citations

Integrative analysis of 111 reference human epigenomes
Cited by 7.1kOpen Access

The reference human genome sequence set the stage for studies of genetic variation and its association with human disease, but epigenomic studies lack a similar reference. To address this need, the NIH Roadmap Epigenomics Consortium generated the largest collection so far of human epigenomes for primary cells and tissues. Here we describe the integrative analysis of 111 reference human epigenomes generated as part of the programme, profiled for histone modification patterns, DNA accessibility, DNA methylation and RNA expression. We establish global maps of regulatory elements, define regulatory modules of coordinated activity, and their likely activators and repressors. We show that disease- and trait-associated genetic variants are enriched in tissue-specific epigenomic marks, revealing biologically relevant cell types for diverse human traits, and providing a resource for interpreting the molecular basis of human disease. Our results demonstrate the central role of epigenomic information for understanding gene regulation, cellular differentiation and human disease.

A high-resolution map of human evolutionary constraint using 29 mammals
Cited by 1.2kOpen Access

The comparison of related genomes has emerged as a powerful lens for genome interpretation. Here we report the sequencing and comparative analysis of 29 eutherian genomes. We confirm that at least 5.5% of the human genome has undergone purifying selection, and locate constrained elements covering ∼4.2% of the genome. We use evolutionary signatures and comparisons with experimental data sets to suggest candidate functions for ∼60% of constrained bases. These elements reveal a small number of new coding exons, candidate stop codon readthrough events and over 10,000 regions of overlapping synonymous constraint within protein-coding exons. We find 220 candidate RNA structural families, and nearly a million elements overlapping potential promoter, enhancer and insulator regions. We report specific amino acid residues that have undergone positive selection, 280,000 non-coding elements exapted from mobile elements and more than 1,000 primate- and human-accelerated elements. Overlap with disease-associated variants indicates that our findings will be relevant for studies of human biology, health and disease. This comparative genomics study, comparing the complete human genome sequence with those of 29 placental mammals, including chimpanzees, mice and dogs, identifies 4.2% of the human genome as constrained by evolutionary selection, and ascribes a potential function to about 60% of these constrained bases. A series of evolutionary signatures emerges, providing insights into coding and non-coding functional genomic elements, candidate RNA structural families and aspects of genome organization and evolution. Overlap with disease-associated variants indicates that the findings will be relevant for studies of human disease.

Combined burden and functional impact tests for cancer driver discovery using DriverPower
Cited by 65Open Access

The discovery of driver mutations is one of the key motivations for cancer genome sequencing. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2658 cancers across 38 tumour types, we describe DriverPower, a software package that uses mutational burden and functional impact evidence to identify driver mutations in coding and non-coding sites within cancer whole genomes. Using a total of 1373 genomic features derived from public sources, DriverPower's background mutation model explains up to 93% of the regional variance in the mutation rate across multiple tumour types. By incorporating functional impact scores, we are able to further increase the accuracy of driver discovery. Testing across a collection of 2583 cancer genomes from the PCAWG project, DriverPower identifies 217 coding and 95 non-coding driver candidates. Comparing to six published methods used by the PCAWG Drivers and Functional Interpretation Working Group, DriverPower has the highest F1 score for both coding and non-coding driver discovery. This demonstrates that DriverPower is an effective framework for computational driver discovery.

Author Correction: Analyses of non-coding somatic drivers in 2,658 cancer whole genomes
Cited by 12Open Access

Correction to: Nature Published online 5 February 2020 In the published version of this paper, the members of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium were listed in the Supplementary Information; however, these members should have been included in the main paper. The original Article has been corrected to include the members and affiliations of the PCAWG Consortium in the main paper; the corrections have been made to the HTML version of the Article but not the PDF version. Additional minor corrections to affiliations have been made to the PDF and HTML versions of the original Article for consistency of information between the PCAWG list and the main paper. An additional affiliation has been added for Husen M. Umer (Department of Oncology-Pathology, Science for Life Laboratory, Karolinska Institute, Stockholm, Sweden). The affiliation for Erik Larsson has been changed from Computational Biology Center, Memorial Sloan Kettering Cancer Center, New York, NY, USA to Institute of Biomedicine, Sahlgrenska Academy at University of Gothenburg, Gothenburg, Sweden.

Biophysical characterization of high-confidence, small human proteins
Allison M. Whited, Irwin Jungreis, Jeffre Allen et al.|Biophysical Reports|2024
Cited by 3Open Access

Significant efforts have been made to characterize the biophysical properties of proteins. Small proteins have received less attention because their annotation has historically been less reliable. However, recent improvements in sequencing, proteomics, and bioinformatics techniques have led to the high-confidence annotation of small open reading frames (smORFs) that encode for functional proteins, producing smORF-encoded proteins (SEPs). SEPs have been found to perform critical functions in several species, including humans. While significant efforts have been made to annotate SEPs, less attention has been given to the biophysical properties of these proteins. We characterized the distributions of predicted and curated biophysical properties, including sequence composition, structure, localization, function, and disease association of a conservative list of previously identified human SEPs. We found significant differences between SEPs and both larger proteins and control sets. In addition, we provide an example of how our characterization of biophysical properties can contribute to distinguishing protein-coding smORFs from noncoding ones in otherwise ambiguous cases.

Similar Researchers

Coming soon — researchers in similar fields and career stages