STARRPeaker: uniform processing and accurate identification of STARR-seq active regions

Donghoon Lee; Manman Shi; Jennifer Moran; Martha Wall; Jing Zhang; Jason Liu; Dominic Fitzgerald; Yasuhiro Kyono; Lijia Ma; Kevin P. White; Mark Gerstein

doi:10.1186/s13059-020-02194-x

STARRPeaker: uniform processing and accurate identification of STARR-seq active regions

Donghoon Lee(Yale University), Manman Shi(University of Chicago), Jennifer Moran(University of Chicago), Martha Wall(University of Chicago), Jing Zhang(University of California, Irvine), Jason Liu(Yale University), Dominic Fitzgerald(University of Chicago), Yasuhiro Kyono(University of Chicago), Lijia Ma(Westlake University), Kevin P. White(University of Illinois Chicago), Mark Gerstein(Yale University)

Genome biology

December 1, 2020

10.1186/s13059-020-02194-x

Cited by 71Open Access

Full Text

Abstract

STARR-seq technology has employed progressively more complex genomic libraries and increased sequencing depths. An issue with the increased complexity and depth is that the coverage in STARR-seq experiments is non-uniform, overdispersed, and often confounded by sequencing biases, such as GC content. Furthermore, STARR-seq readout is confounded by RNA secondary structure and thermodynamic stability. To address these potential confounders, we developed a negative binomial regression framework for uniformly processing STARR-seq data, called STARRPeaker. Moreover, to aid our effort, we generated whole-genome STARR-seq data from the HepG2 and K562 human cell lines and applied STARRPeaker to comprehensively and unbiasedly call enhancers in them.

Related Papers

No related papers found

Powered by citation graph analysis