On DNA numerical representations for genomic similarity computation
Genomic signal processing (GSP) refers to the use of signal processing for the analysis of genomic data. GSP methods require the transformation or mapping of the genomic data to a numeric representation. To date, several DNA numeric representations (DNR) have been proposed; however, it is not clear...
Saved in:
Published in | PloS one Vol. 12; no. 3; p. e0173288 |
---|---|
Main Authors | , , , , |
Format | Journal Article |
Language | English |
Published |
United States
Public Library of Science
21.03.2017
Public Library of Science (PLoS) |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Genomic signal processing (GSP) refers to the use of signal processing for the analysis of genomic data. GSP methods require the transformation or mapping of the genomic data to a numeric representation. To date, several DNA numeric representations (DNR) have been proposed; however, it is not clear what the properties of each DNR are and how the selection of one will affect the results when using a signal processing technique to analyze them. In this paper, we present an experimental study of the characteristics of nine of the most frequently-used DNR. The objective of this paper is to evaluate the behavior of each representation when used to measure the similarity of a given pair of DNA sequences. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 Competing Interests: The authors have declared that no competing interests exist. Conceptualization: GM IR ST RS JAM.Data curation: IR ST.Formal analysis: GM IR ST RS.Investigation: GM IR ST RS JAM.Methodology: GM JAM.Project administration: JAM.Software: GM IR ST RS.Validation: JAM.Visualization: ST JAM.Writing – original draft: GM IR ST RS JAM.Writing – review & editing: GM IR ST RS JAM. |
ISSN: | 1932-6203 1932-6203 |
DOI: | 10.1371/journal.pone.0173288 |