NextPolish: a fast and efficient genome polishing tool for long-read assembly
Abstract Motivation Although long-read sequencing technologies can produce genomes with long contiguity, they suffer from high error rates. Thus, we developed NextPolish, a tool that efficiently corrects sequence errors in genomes assembled with long reads. This new tool consists of two interlinked...
Saved in:
Published in | Bioinformatics Vol. 36; no. 7; pp. 2253 - 2255 |
---|---|
Main Authors | , , , |
Format | Journal Article |
Language | English |
Published |
England
Oxford University Press
01.04.2020
|
Online Access | Get full text |
Cover
Loading…
Summary: | Abstract
Motivation
Although long-read sequencing technologies can produce genomes with long contiguity, they suffer from high error rates. Thus, we developed NextPolish, a tool that efficiently corrects sequence errors in genomes assembled with long reads. This new tool consists of two interlinked modules that are designed to score and count K-mers from high quality short reads, and to polish genome assemblies containing large numbers of base errors.
Results
When evaluated for the speed and efficiency using human and a plant (Arabidopsis thaliana) genomes, NextPolish outperformed Pilon by correcting sequence errors faster, and with a higher correction accuracy.
Availability and implementation
NextPolish is implemented in C and Python. The source code is available from https://github.com/Nextomics/NextPolish.
Supplementary information
Supplementary data are available at Bioinformatics online. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 1367-4803 1367-4811 1460-2059 1367-4811 |
DOI: | 10.1093/bioinformatics/btz891 |