High Performance computing improvements on bioinformatics consistency-based multiple sequence alignment tools
•BGT method improves the execution time of progressive alignment by 62%.•OLM decreases the memory requirements by 75%, aligning up twice more sequences than previous method.•MTA improves the alignments accuracy, letting it to regain the quality lost due to the performance improvements. Multiple Sequ...
Saved in:
Published in | Parallel computing Vol. 42; pp. 18 - 34 |
---|---|
Main Authors | , , , , |
Format | Journal Article |
Language | English |
Published |
Elsevier B.V
01.02.2015
Elsevier |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | •BGT method improves the execution time of progressive alignment by 62%.•OLM decreases the memory requirements by 75%, aligning up twice more sequences than previous method.•MTA improves the alignments accuracy, letting it to regain the quality lost due to the performance improvements.
Multiple Sequence Alignment (MSA) is essential for a wide range of applications in Bioinformatics. Traditionally, the alignment accuracy was the main metric used to evaluate the goodness of MSA tools. However, with the growth of sequencing data, other features, such as performance and the capacity to align larger datasets, are gaining strength. To achieve these new requirements, without affecting accuracy, the use of high-performance computing (HPC) resources and techniques is crucial. In this paper, we apply HPC techniques in T-Coffee, one of the more accurate but less scalable MSA tools. We integrate three innovative solutions into T-Coffee: the Balanced Guide Tree to increase the parallelism/performance, the Optimized Library Method with the aim of enhancing the scalability and the Multiple Tree Alignment, which explores different alignments in parallel to improve the accuracy. The results obtained show that the resulting tool, MTA-TCoffee, is able to improve the scalability in both the execution time and also the number of sequences to be aligned. Furthermore, not only is the alignment accuracy not affected by these improvements, as would be expected, but it improves significantly. Finally, we emphasize that the presented methods are not just restricted to T-Coffee, but may be implemented in any other alignment tools that use similar algorithms (progressive alignment, consistency or guide trees). |
---|---|
ISSN: | 0167-8191 1872-7336 |
DOI: | 10.1016/j.parco.2014.09.010 |