DEGAP: Dynamic elongation of a genome assembly path

Genome assembly remains to be a major task in genomic research. Despite the development over the past decades of different assembly software programs and algorithms, it is still a great challenge to assemble a complete genome without any gaps. With the latest DNA circular consensus sequencing (CCS)...

Full description

Saved in:
Bibliographic Details
Published inBriefings in bioinformatics Vol. 25; no. 3
Main Authors Huang, Yicheng, Wang, Ziyuan, Schmidt, Monica A, Su, Handong, Xiong, Lizhong, Zhang, Jianwei
Format Journal Article
LanguageEnglish
Published England 27.03.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Genome assembly remains to be a major task in genomic research. Despite the development over the past decades of different assembly software programs and algorithms, it is still a great challenge to assemble a complete genome without any gaps. With the latest DNA circular consensus sequencing (CCS) technology, several assembly programs can now build a genome from raw sequencing data to contigs; however, some complex sequence regions remain as unresolved gaps. Here, we present a novel gap-filling software, DEGAP (Dynamic Elongation of a Genome Assembly Path), that resolves gap regions by utilizing the dual advantages of accuracy and length of high-fidelity (HiFi) reads. DEGAP identifies differences between reads and provides 'GapFiller' or 'CtgLinker' modes to eliminate or shorten gaps in genomes. DEGAP adopts an iterative elongation strategy that automatically and dynamically adjusts parameters according to three complexity factors affecting the genome to determine the optimal extension path. DEGAP has already been successfully applied to decipher complex genomic regions in several projects and may be widely employed to generate more gap-free genomes.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1467-5463
1477-4054
DOI:10.1093/bib/bbae194