Application of Quaternion in improving the quality of global sequence alignment scores for an ambiguous sequence target in Streptococcus pneumoniae DNA
DNA sequence can be defined as a succession of letters, representing the order of nucleotides within DNA, using a permutation of four DNA base codes including adenine (A), guanine (G), cytosine (C), and thymine (T). The precise code of the sequences is determined using DNA sequencing methods and tec...
Saved in:
Published in | AIP conference proceedings Vol. 1862; no. 1 |
---|---|
Main Authors | , , , |
Format | Journal Article Conference Proceeding |
Language | English |
Published |
Melville
American Institute of Physics
10.07.2017
|
Subjects | |
Online Access | Get full text |
ISSN | 0094-243X 1551-7616 |
DOI | 10.1063/1.4991226 |
Cover
Loading…
Abstract | DNA sequence can be defined as a succession of letters, representing the order of nucleotides within DNA, using a permutation of four DNA base codes including adenine (A), guanine (G), cytosine (C), and thymine (T). The precise code of the sequences is determined using DNA sequencing methods and technologies, which have been developed since the 1970s and currently become highly developed, advanced and highly throughput sequencing technologies. So far, DNA sequencing has greatly accelerated biological and medical research and discovery. However, in some cases DNA sequencing could produce any ambiguous and not clear enough sequencing results that make them quite difficult to be determined whether these codes are A, T, G, or C. To solve these problems, in this study we can introduce other representation of DNA codes namely Quaternion
Q = (PA
, PT
, PG
, PC
), where PA
, PT
, PG
, PC
are the probability of A, T, G, C bases that could appear in Q and PA
+ PT
+ PG
+ PC
= 1. Furthermore, using Quaternion representations we are able to construct the improved scoring matrix for global sequence alignment processes, by applying a dot product method. Moreover, this scoring matrix produces better and higher quality of the match and mismatch score between two DNA base codes. In implementation, we applied the Needleman-Wunsch global sequence alignment algorithm using Octave, to analyze our target sequence which contains some ambiguous sequence data. The subject sequences are the DNA sequences of Streptococcus pneumoniae families obtained from the Genebank, meanwhile the target DNA sequence are received from our collaborator database. As the results we found the Quaternion representations improve the quality of the sequence alignment score and we can conclude that DNA sequence target has maximum similarity with Streptococcus pneumoniae. |
---|---|
AbstractList | DNA sequence can be defined as a succession of letters, representing the order of nucleotides within DNA, using a permutation of four DNA base codes including adenine (A), guanine (G), cytosine (C), and thymine (T). The precise code of the sequences is determined using DNA sequencing methods and technologies, which have been developed since the 1970s and currently become highly developed, advanced and highly throughput sequencing technologies. So far, DNA sequencing has greatly accelerated biological and medical research and discovery. However, in some cases DNA sequencing could produce any ambiguous and not clear enough sequencing results that make them quite difficult to be determined whether these codes are A, T, G, or C. To solve these problems, in this study we can introduce other representation of DNA codes namely Quaternion Q = (PA, PT, PG, PC), where PA, PT, PG, PC are the probability of A, T, G, C bases that could appear in Q and PA + PT + PG + PC = 1. Furthermore, using Quaternion representations we are able to construct the improved scoring matrix for global sequence alignment processes, by applying a dot product method. Moreover, this scoring matrix produces better and higher quality of the match and mismatch score between two DNA base codes. In implementation, we applied the Needleman-Wunsch global sequence alignment algorithm using Octave, to analyze our target sequence which contains some ambiguous sequence data. The subject sequences are the DNA sequences of Streptococcus pneumoniae families obtained from the Genebank, meanwhile the target DNA sequence are received from our collaborator database. As the results we found the Quaternion representations improve the quality of the sequence alignment score and we can conclude that DNA sequence target has maximum similarity with Streptococcus pneumoniae. DNA sequence can be defined as a succession of letters, representing the order of nucleotides within DNA, using a permutation of four DNA base codes including adenine (A), guanine (G), cytosine (C), and thymine (T). The precise code of the sequences is determined using DNA sequencing methods and technologies, which have been developed since the 1970s and currently become highly developed, advanced and highly throughput sequencing technologies. So far, DNA sequencing has greatly accelerated biological and medical research and discovery. However, in some cases DNA sequencing could produce any ambiguous and not clear enough sequencing results that make them quite difficult to be determined whether these codes are A, T, G, or C. To solve these problems, in this study we can introduce other representation of DNA codes namely Quaternion Q = (PA , PT , PG , PC ), where PA , PT , PG , PC are the probability of A, T, G, C bases that could appear in Q and PA + PT + PG + PC = 1. Furthermore, using Quaternion representations we are able to construct the improved scoring matrix for global sequence alignment processes, by applying a dot product method. Moreover, this scoring matrix produces better and higher quality of the match and mismatch score between two DNA base codes. In implementation, we applied the Needleman-Wunsch global sequence alignment algorithm using Octave, to analyze our target sequence which contains some ambiguous sequence data. The subject sequences are the DNA sequences of Streptococcus pneumoniae families obtained from the Genebank, meanwhile the target DNA sequence are received from our collaborator database. As the results we found the Quaternion representations improve the quality of the sequence alignment score and we can conclude that DNA sequence target has maximum similarity with Streptococcus pneumoniae. |
Author | Lestari, D. Bustamam, A. Ardaneswari, G. Novianti, T. |
Author_xml | – sequence: 1 givenname: D. surname: Lestari fullname: Lestari, D. organization: Department of Mathematics, Faculty of Mathematics and Natural Sciences (FMIPA), Universitas Indonesia, Depok 16424, Indonesia – sequence: 2 givenname: A. surname: Bustamam fullname: Bustamam, A. email: alhadi@sci.ui.ac.id organization: Department of Mathematics, Faculty of Mathematics and Natural Sciences (FMIPA), Universitas Indonesia, Depok 16424, Indonesia – sequence: 3 givenname: T. surname: Novianti fullname: Novianti, T. organization: Department of Mathematics, Faculty of Mathematics and Natural Sciences (FMIPA), Universitas Indonesia, Depok 16424, Indonesia – sequence: 4 givenname: G. surname: Ardaneswari fullname: Ardaneswari, G. organization: Department of Mathematics, Faculty of Mathematics and Natural Sciences (FMIPA), Universitas Indonesia, Depok 16424, Indonesia |
BookMark | eNp9kM1KxDAQgIMouKsefIOAN6GaSZu0PS7-gyiigreSZqc1S5t0k1TwSXxdu7rgTRgYhvmYn29Odq2zSMgxsDNgMj2Hs6wsgXO5Q2YgBCS5BLlLZoyVWcKz9G2fzENYMcbLPC9m5GsxDJ3RKhpnqWvo06gierupzBT94N2HsS2N70jXo-pM_Nxgbedq1dGA6xGtRjo1WtujjTRo5zHQxnmqLFV9bdrRjeEPjcq3GDfTn6PHITrttJ6AweLYO2sU0suHxSHZa1QX8GibD8jr9dXLxW1y_3hzd7G4TwZeFDFJhSigLGVdQC7LRoBq0hpyrVkhsMzqVIqmAZVyJkS-zFBzoTkuQaJSPANID8jJ79zp0enAEKuVG72dVlYcQE7WcpZP1OkvFbSJP66qwZte-c8KWLURX0G1Ff8f_OH8H1gNyyb9Bq-4iIY |
CODEN | APCPCS |
ContentType | Journal Article Conference Proceeding |
Copyright | Author(s) 2017 Author(s). Published by AIP Publishing. |
Copyright_xml | – notice: Author(s) – notice: 2017 Author(s). Published by AIP Publishing. |
DBID | 8FD H8D L7M |
DOI | 10.1063/1.4991226 |
DatabaseName | Technology Research Database Aerospace Database Advanced Technologies Database with Aerospace |
DatabaseTitle | Technology Research Database Aerospace Database Advanced Technologies Database with Aerospace |
DatabaseTitleList | Technology Research Database |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Physics |
EISSN | 1551-7616 |
Editor | Triyono, Djoko Sugeng, Kiki A. Mart, Terry |
Editor_xml | – sequence: 1 givenname: Terry surname: Mart fullname: Mart, Terry organization: Universitas Indonesia – sequence: 2 givenname: Djoko surname: Triyono fullname: Triyono, Djoko organization: Universitas Indonesia – sequence: 3 givenname: Kiki A. surname: Sugeng fullname: Sugeng, Kiki A. organization: Universitas Indonesia |
ExternalDocumentID | acp |
Genre | Conference Proceeding |
GroupedDBID | -~X 23M 5GY AAAAW AABDS AAEUA AAPUP AAYIH ABJNI ACBRY ACZLF ADCTM AEJMO AFATG AFHCQ AGKCL AGLKD AGMXG AGTJO AHSDT AJJCW ALEPV ALMA_UNASSIGNED_HOLDINGS ATXIE AWQPM BPZLN F5P FDOHQ FFFMQ HAM M71 M73 RIP RQS SJN ~02 8FD ABJGX ADMLS H8D L7M |
ID | FETCH-LOGICAL-p288t-35581996b81769f51af3b17cc085e94b365ff1a320557d4ec25c2ed16eaa24113 |
ISSN | 0094-243X |
IngestDate | Mon Jun 30 07:04:25 EDT 2025 Sun Jul 14 10:05:17 EDT 2019 Fri Jun 21 00:14:38 EDT 2024 |
IsDoiOpenAccess | false |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 1 |
Language | English |
License | 0094-243X/2017/1862/030122/7/$30.00 Published by AIP Publishing. |
LinkModel | OpenURL |
MeetingName | INTERNATIONAL SYMPOSIUM ON CURRENT PROGRESS IN MATHEMATICS AND SCIENCES 2016 (ISCPMS 2016): Proceedings of the 2nd International Symposium on Current Progress in Mathematics and Sciences 2016 |
MergedId | FETCHMERGED-LOGICAL-p288t-35581996b81769f51af3b17cc085e94b365ff1a320557d4ec25c2ed16eaa24113 |
Notes | ObjectType-Conference Proceeding-1 SourceType-Conference Papers & Proceedings-1 content type line 21 |
OpenAccessLink | https://aip.scitation.org/doi/pdf/10.1063/1.4991226 |
PQID | 2116094707 |
PQPubID | 2050672 |
PageCount | 7 |
ParticipantIDs | scitation_primary_10_1063_1_4991226 proquest_journals_2116094707 |
PublicationCentury | 2000 |
PublicationDate | 2017-07-10 |
PublicationDateYYYYMMDD | 2017-07-10 |
PublicationDate_xml | – month: 07 year: 2017 text: 2017-07-10 day: 10 |
PublicationDecade | 2010 |
PublicationPlace | Melville |
PublicationPlace_xml | – name: Melville |
PublicationTitle | AIP conference proceedings |
PublicationYear | 2017 |
Publisher | American Institute of Physics |
Publisher_xml | – name: American Institute of Physics |
References | Watson, Crick (c2) 1953 Cohen (c6) 2004 Needleman, Chrisrtian (c8) 1969 Brodzik (c5) 2007 Heyer (c7) 2008 Shu, Ouw (c4) 2004 |
References_xml | – start-page: 122 year: 2004 ident: c6 publication-title: ACM Comput. Surv. – start-page: 737 year: 1953 ident: c2 publication-title: Nature – start-page: 101 year: 2008 ident: c7 publication-title: PRIMUS – start-page: 1423 year: 2004 ident: c4 publication-title: Bull. Math. Biol. – start-page: 443 year: 1969 ident: c8 publication-title: J. Mol. Biol. – start-page: 694 year: 2007 ident: c5 publication-title: Bioinformatics |
SSID | ssj0029778 |
Score | 2.1070607 |
Snippet | DNA sequence can be defined as a succession of letters, representing the order of nucleotides within DNA, using a permutation of four DNA base codes including... |
SourceID | proquest scitation |
SourceType | Aggregation Database Enrichment Source Publisher |
SubjectTerms | Alignment Codes Deoxyribonucleic acid DNA Gene sequencing Medical research Nucleotides Permutations Quaternions Representations Streptococcus infections Thymine |
Title | Application of Quaternion in improving the quality of global sequence alignment scores for an ambiguous sequence target in Streptococcus pneumoniae DNA |
URI | http://dx.doi.org/10.1063/1.4991226 https://www.proquest.com/docview/2116094707 |
Volume | 1862 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1db9MwFLVKJwRvfAyxMZAleKsysOMkzWPFNgbqyhCt1LfITuwpD0ujNgGJP8IP4I9yXTt2yiokkKqoSq6cyPfEPr4-9wahN6JgkSoiFhDOacBSzgIuUhVwThQhMkl5opOTr2bx5YJ9WkbLweBXT7XUNuI0_7E3r-R_vArnwK86S_YfPOsahRPwH_wLR_AwHO_6eO9UM_l4rYXjXbFYb7PpY2Hid6k1OfzSch0GtCrH0oUVNAc1SZbbbXdbK6QTW4_gwo3RDmx07cuNFWCO-K0ob1otpXWmRl-uW9e73nWzgmE3B4O6ki30Tsnl6GzmsDOFmYnbjHcXHtCJXbcGrc5wBs8JSCh3xN2TdcFhvP5uW_jQj2OQbYDUKlp96gCvdkUSWxls3g9bwnI0oGz7DWGYv-ygHZEgiU3Oph_VY_onfu9MF8DPdOTiFJZ9hNI9Jblnn7OLxXSazc-X83vogIZAqIboYHJ2Nf3q1vVAoc2Ebx-tK2AVh29d0zsLmAfAbozQosdl5o_Qoc_yxNcOMI_RQFZP0H3bF0_Rzx5q8Ephjxpcwq9DDQbUYIsabWZQgzsoYIcabFCDATWYV9ihxpsa1OjWd1CDPWowoOYQLS7O5-8vA_sZj6Cm43ET6AL-WusuxiSJUxURrkJBkjwHti9TJsI4UorwkOpycAWTOY1yKgsSSxg-GCHhMzSsVpV8jnAqY1XAmkTwXAETDYVIRRKxlIRAsoDcHqGTrpcz-55uMkpIDK5J3iVH6LXr-aw21VyyrQojDjOSWVfttfq2WnuLrC7U8d9v9QI99CA_QcNm3cqXQF8b8crC5zeQ-6Zm |
linkProvider | EBSCOhost |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=proceeding&rft.title=AIP+conference+proceedings&rft.atitle=Application+of+Quaternion+in+improving+the+quality+of+global+sequence+alignment+scores+for+an+ambiguous+sequence+target+in+Streptococcus+pneumoniae+DNA&rft.au=Lestari%2C+D&rft.au=Bustamam%2C+A&rft.au=Novianti%2C+T&rft.au=Ardaneswari%2C+G&rft.date=2017-07-10&rft.pub=American+Institute+of+Physics&rft.issn=0094-243X&rft.eissn=1551-7616&rft.volume=1862&rft.issue=1&rft_id=info:doi/10.1063%2F1.4991226&rft.externalDBID=NO_FULL_TEXT |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0094-243X&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0094-243X&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0094-243X&client=summon |