CrossMap: a versatile tool for coordinate conversion between genome assemblies

Motivation: Reference genome assemblies are subject to change and refinement from time to time. Generally, researchers need to convert the results that have been analyzed according to old assemblies to newer versions, or vice versa, to facilitate meta-analysis, direct comparison, data integration an...

Full description

Saved in:
Bibliographic Details
Published inBioinformatics Vol. 30; no. 7; pp. 1006 - 1007
Main Authors Zhao, Hao, Sun, Zhifu, Wang, Jing, Huang, Haojie, Kocher, Jean-Pierre, Wang, Liguo
Format Journal Article
LanguageEnglish
Published England Oxford University Press 01.04.2014
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Motivation: Reference genome assemblies are subject to change and refinement from time to time. Generally, researchers need to convert the results that have been analyzed according to old assemblies to newer versions, or vice versa, to facilitate meta-analysis, direct comparison, data integration and visualization. Several useful conversion tools can convert genome interval files in browser extensible data or general feature format, but none have the functionality to convert files in sequence alignment map or BigWig format. This is a significant gap in computational genomics tools, as these formats are the ones most widely used for representing high-throughput sequencing data, such as RNA-seq, chromatin immunoprecipitation sequencing, DNA-seq, etc. Results: Here we developed CrossMap, a versatile and efficient tool for converting genome coordinates between assemblies. CrossMap supports most of the commonly used file formats, including BAM, sequence alignment map, Wiggle, BigWig, browser extensible data, general feature format, gene transfer format and variant call format. Availability and implementation: CrossMap is written in Python and C. Source code and a comprehensive user’s manual are freely available at: http://crossmap.sourceforge.net/. Contact:  Kocher.JeanPierre@mayo.edu or wang.liguo@mayo.edu Supplementary information:  Supplementary data are available at Bioinformatics online.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Associate Editor: John Hancock
ISSN:1367-4803
1367-4811
1367-4811
1460-2059
DOI:10.1093/bioinformatics/btt730