Three Rounds of Read Correction Significantly Improve Eukaryotic Protein Detection in ONT Reads

Eukaryotes' whole-genome sequencing is crucial for species identification, gene detection, and protein annotation. Oxford Nanopore Technology (ONT) is an affordable and rapid platform for sequencing eukaryotes; however, the relatively higher error rates require computational and bioinformatic e...

Full description

Saved in:
Bibliographic Details
Published inMicroorganisms (Basel) Vol. 12; no. 2; p. 247
Main Authors Safar, Hussain A, Alatar, Fatemah, Mustafa, Abu Salim
Format Journal Article
LanguageEnglish
Published Switzerland MDPI AG 24.01.2024
MDPI
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Eukaryotes' whole-genome sequencing is crucial for species identification, gene detection, and protein annotation. Oxford Nanopore Technology (ONT) is an affordable and rapid platform for sequencing eukaryotes; however, the relatively higher error rates require computational and bioinformatic efforts to produce more accurate genome assemblies. Here, we evaluated the effect of read correction tools on eukaryote genome completeness, gene detection and protein annotation. Reads generated by ONT of four eukaryotes, , , , and , were assembled using minimap2 and underwent three rounds of read correction using flye, medaka and racon. The generates consensus FASTA files were compared for total length (bp), genome completeness, gene detection, and protein-annotation by QUAST, BUSCO, BRAKER1 and InterProScan, respectively. Genome completeness was dependent on the assembly method rather than on the read correction tool; however, medaka performed better than flye and racon. Racon significantly performed better than flye and medaka in gene detection, while both racon and medaka significantly performed better than flye in protein-annotation. We show that three rounds of read correction significantly affect gene detection and protein annotation, which are dependent on assembly quality in preference to assembly completeness.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:2076-2607
2076-2607
DOI:10.3390/microorganisms12020247