Porechop_ABI: discovering unknown adapters in Oxford Nanopore Technology sequencing reads for downstream trimming

Abstract Motivation Oxford Nanopore Technologies (ONT) sequencing has become very popular over the past few years and offers a cost-effective solution for many genomic and transcriptomic projects. One distinctive feature of the technology is that the protocol includes the ligation of adapters to bot...

Full description

Saved in:
Bibliographic Details
Published inBioinformatics advances Vol. 3; no. 1; p. vbac085
Main Authors Bonenfant, Quentin, Noé, Laurent, Touzet, Hélène
Format Journal Article
LanguageEnglish
Published England Oxford University Press 2023
Oxford academic
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Abstract Motivation Oxford Nanopore Technologies (ONT) sequencing has become very popular over the past few years and offers a cost-effective solution for many genomic and transcriptomic projects. One distinctive feature of the technology is that the protocol includes the ligation of adapters to both ends of each fragment. Those adapters should then be removed before downstream analyses, either during the basecalling step or by explicit trimming. This basic task may be tricky when the definition of the adapter sequence is not well documented. Results We have developed a new method to scan a set of ONT reads to see if it contains adapters, without any prior knowledge on the sequence of the potential adapters, and then trim out those adapters. The algorithm is based on approximate k-mers and is able to discover adapter sequences based on their frequency alone. The method was successfully tested on a variety of ONT datasets with different flowcells, sequencing kits and basecallers. Availability and implementation The resulting software, named Porechop_ABI, is open-source and is available at https://github.com/bonsai-team/Porechop_ABI. Supplementary information Supplementary data are available at Bioinformatics advances online.
Bibliography:SourceType-Scholarly Journals-1
content type line 14
ObjectType-Report-1
ObjectType-Article-1
ObjectType-Feature-2
content type line 23
PMCID: PMC9869717
ISSN:2635-0041
2635-0041
DOI:10.1093/bioadv/vbac085