An Efficient Parallel Algorithm for Multiple Sequence Similarities Calculation Using a Low Complexity Method

With the advance of genomic researches, the number of sequences involved in comparative methods has grown immensely. Among them, there are methods for similarities calculation, which are used by many bioinformatics applications. Due the huge amount of data, the union of low complexity methods with t...

Full description

Saved in:
Bibliographic Details
Published inBioMed research international Vol. 2014; no. 2014; pp. 1 - 6
Main Authors Marucci, Evandro A., Zafalon, Geraldo F. D., Momente, Julio C., Neves, Leandro A., Valêncio, Carlo R., Pinto, Alex R., Cansian, Adriano M., de Souza, Rogeria C. G., Shiyou, Yang, Machado, José M.
Format Journal Article
LanguageEnglish
Published Cairo, Egypt Hindawi Puplishing Corporation 01.01.2014
Hindawi Publishing Corporation
Hindawi Limited
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:With the advance of genomic researches, the number of sequences involved in comparative methods has grown immensely. Among them, there are methods for similarities calculation, which are used by many bioinformatics applications. Due the huge amount of data, the union of low complexity methods with the use of parallel computing is becoming desirable. The k-mers counting is a very efficient method with good biological results. In this work, the development of a parallel algorithm for multiple sequence similarities calculation using the k-mers counting method is proposed. Tests show that the algorithm presents a very good scalability and a nearly linear speedup. For 14 nodes was obtained 12x speedup. This algorithm can be used in the parallelization of some multiple sequence alignment tools, such as MAFFT and MUSCLE.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Academic Editor: Tzong-Yi Lee
ISSN:2314-6133
2314-6141
DOI:10.1155/2014/563016