TY - JOUR
T1 - CrossMap
T2 - A versatile tool for coordinate conversion between genome assemblies
AU - Zhao, Hao
AU - Sun, Zhifu
AU - Wang, Jing
AU - Huang, Haojie
AU - Kocher, Jean Pierre
AU - Wang, Liguo
N1 - Funding Information:
Funding: This work was funded by the Bioinformatics Program, Center for Individualized Medicine of Mayo Clinic [C4331304 to K.JP., W.L.]; NIH Cancer Center Support Grant (P30 CA016672), Bioinformatics Shared Resources [PP-SR22 to W.J., Z.H.].
PY - 2014/4/1
Y1 - 2014/4/1
N2 - Motivation: Reference genome assemblies are subject to change and refinement from time to time. Generally, researchers need to convert the results that have been analyzed according to old assemblies to newer versions, or vice versa, to facilitate meta-analysis, direct comparison, data integration and visualization. Several useful conversion tools can convert genome interval files in browser extensible data or general feature format, but none have the functionality to convert files in sequence alignment map or BigWig format. This is a significant gap in computational genomics tools, as these formats are the ones most widely used for representing high-throughput sequencing data, such as RNA-seq, chromatin immunoprecipitation sequencing, DNA-seq, etc. Results: Here we developed CrossMap, a versatile and efficient tool for converting genome coordinates between assemblies. CrossMap supports most of the commonly used file formats, including BAM, sequence alignment map, Wiggle, BigWig, browser extensible data, general feature format, gene transfer format and variant call format.
AB - Motivation: Reference genome assemblies are subject to change and refinement from time to time. Generally, researchers need to convert the results that have been analyzed according to old assemblies to newer versions, or vice versa, to facilitate meta-analysis, direct comparison, data integration and visualization. Several useful conversion tools can convert genome interval files in browser extensible data or general feature format, but none have the functionality to convert files in sequence alignment map or BigWig format. This is a significant gap in computational genomics tools, as these formats are the ones most widely used for representing high-throughput sequencing data, such as RNA-seq, chromatin immunoprecipitation sequencing, DNA-seq, etc. Results: Here we developed CrossMap, a versatile and efficient tool for converting genome coordinates between assemblies. CrossMap supports most of the commonly used file formats, including BAM, sequence alignment map, Wiggle, BigWig, browser extensible data, general feature format, gene transfer format and variant call format.
UR - http://www.scopus.com/inward/record.url?scp=84897373336&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84897373336&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/btt730
DO - 10.1093/bioinformatics/btt730
M3 - Article
C2 - 24351709
AN - SCOPUS:84897373336
SN - 1367-4803
VL - 30
SP - 1006
EP - 1007
JO - Bioinformatics
JF - Bioinformatics
IS - 7
ER -