Applications of Machine Learning Tools in Genomics: A Review

To overcome the challenges presented by the manual analysis of large datasets inherent to DNA sequences, machine learning (ML) tools are commonly employed due to their relative accuracy and ease of implementation. However, no consensus exists regarding the most broadly applicable and effective machi...

Full description

Saved in:
Bibliographic Details
Published inSmart Computing and Communication Vol. 11910; pp. 330 - 340
Main Authors Fracasso, Joseph L., Ali, Md Liakat
Format Book Chapter
LanguageEnglish
Published Switzerland Springer International Publishing AG 2019
Springer International Publishing
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:To overcome the challenges presented by the manual analysis of large datasets inherent to DNA sequences, machine learning (ML) tools are commonly employed due to their relative accuracy and ease of implementation. However, no consensus exists regarding the most broadly applicable and effective machine learning tool for performing multiple analysis on DNA sequences, with many researchers instead opting to utilize proprietary hybrids. Review determined that the modal techniques among the literature surveyed were support vector machines and neural networks, both existing in modified proprietary forms to best fit DNA sequence analysis. Analyses were principally focused on site specific activities which were then used to create inferences regarding the whole of the molecule. These findings suggest neural networks and support vector machines as verified by Bayesian statistics may be the optimal approach for analyzing long DNA sequences.
ISBN:3030341380
9783030341381
ISSN:0302-9743
1611-3349
DOI:10.1007/978-3-030-34139-8_33