CrossMap: a versatile tool for coordinate conversion between genome assemblies

Hao Zhao(Mayo Clinic), Zhifu Sun(Mayo Clinic), Jing Wang(Mayo Clinic), Haojie Huang(Mayo Clinic), Jean‐Pierre Kocher(Mayo Clinic), Liguo Wang(Mayo Clinic)
Bioinformatics
December 18, 2013
Cited by 859Open Access
Full Text

Abstract

MOTIVATION: Reference genome assemblies are subject to change and refinement from time to time. Generally, researchers need to convert the results that have been analyzed according to old assemblies to newer versions, or vice versa, to facilitate meta-analysis, direct comparison, data integration and visualization. Several useful conversion tools can convert genome interval files in browser extensible data or general feature format, but none have the functionality to convert files in sequence alignment map or BigWig format. This is a significant gap in computational genomics tools, as these formats are the ones most widely used for representing high-throughput sequencing data, such as RNA-seq, chromatin immunoprecipitation sequencing, DNA-seq, etc. RESULTS: Here we developed CrossMap, a versatile and efficient tool for converting genome coordinates between assemblies. CrossMap supports most of the commonly used file formats, including BAM, sequence alignment map, Wiggle, BigWig, browser extensible data, general feature format, gene transfer format and variant call format. AVAILABILITY AND IMPLEMENTATION: CrossMap is written in Python and C. Source code and a comprehensive user's manual are freely available at: http://crossmap.sourceforge.net/.


Related Papers

No related papers found

Powered by citation graph analysis