H

Harold Pimentel

University of California, Los Angeles

ORCID: 0000-0001-8556-2499

Publishes on Genomics and Phylogenetic Studies, RNA modifications and cancer, Gene expression and cancer classification. 62 papers and 42.2k citations.

62Publications
42.2kTotal Citations

Is this you? Claim your profile.

Add your photo, update your bio, and get notified when your ranking changes.

Top publicationsby citations

TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions
Daehwan Kim, Geo Pertea, Cole Trapnell et al.|Genome biology|2013
Cited by 13.6kOpen Access

TopHat is a popular spliced aligner for RNA-sequence (RNA-seq) experiments. In this paper, we describe TopHat2, which incorporates many significant enhancements to TopHat. TopHat2 can align reads of various lengths produced by the latest sequencing technologies, while allowing for variable-length indels with respect to the reference genome. In addition to de novo spliced alignment, TopHat2 can align reads across fusion breaks, which can occur after genomic translocations. TopHat2 combines the ability to identify novel splice sites with direct mapping to known transcripts, producing sensitive and accurate alignments, even for highly repetitive genomes or in the presence of pseudogenes. TopHat2 is available at http://ccb.jhu.edu/software/tophat.

Identification of novel transcripts in annotated genomes using RNA-Seq
Adam Roberts, Harold Pimentel, Cole Trapnell et al.|Bioinformatics|2011
Cited by 1kOpen Access

SUMMARY: We describe a new 'reference annotation based transcript assembly' problem for RNA-Seq data that involves assembling novel transcripts in the context of an existing annotation. This problem arises in the analysis of expression in model organisms, where it is desirable to leverage existing annotations for discovering novel transcripts. We present an algorithm for reference annotation-based transcript assembly and show how it can be used to rapidly investigate novel transcripts revealed by RNA-Seq in comparison with a reference annotation. AVAILABILITY: The methods described in this article are implemented in the Cufflinks suite of software for RNA-Seq, freely available from http://bio.math.berkeley.edu/cufflinks. The software is released under the BOOST license. CONTACT: cole@broadinstitute.org; lpachter@math.berkeley.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.