RedOak: a reference-free and alignment-free structure for indexing a collection of similar genomes

Here we present RedOak, a reference-free and alignment-free software package that allows for the indexing of a large collection of similar genomes. RedOak can also be applied to reads from unassembled genomes, and it provides a nucleotide sequence query function. Our method is about the analysis of...

Full description

Saved in:
Bibliographic Details
Published inJournal of open source software Vol. 7; no. 80; p. 4363
Main Authors Agret, Clément, Chateau, Annie, Droc, Gaetan, Sarah, Gautier, Ruiz, Manuel, Mancheron, Alban
Format Journal Article
LanguageEnglish
Published Open Journals 28.12.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Here we present RedOak, a reference-free and alignment-free software package that allows for the indexing of a large collection of similar genomes. RedOak can also be applied to reads from unassembled genomes, and it provides a nucleotide sequence query function. Our method is about the analysis of complete genomes from the 3000 rice genomes sequencing project, but our indexing structure is generic enough to be used in similar projects. This software is based on a k-mer approach and has been developed to be heavily parallelized and distributed on several nodes of a cluster. The source code of our RedOak algorithm is available at RedOak.
ISSN:2475-9066
2475-9066
DOI:10.21105/joss.04363