Application of Quaternion in improving the quality of global sequence alignment scores for an ambiguous sequence target in Streptococcus pneumoniae DNA

DNA sequence can be defined as a succession of letters, representing the order of nucleotides within DNA, using a permutation of four DNA base codes including adenine (A), guanine (G), cytosine (C), and thymine (T). The precise code of the sequences is determined using DNA sequencing methods and tec...

Full description

Saved in:
Bibliographic Details
Published inAIP conference proceedings Vol. 1862; no. 1
Main Authors Lestari, D., Bustamam, A., Novianti, T., Ardaneswari, G.
Format Journal Article Conference Proceeding
LanguageEnglish
Published Melville American Institute of Physics 10.07.2017
Subjects
Online AccessGet full text
ISSN0094-243X
1551-7616
DOI10.1063/1.4991226

Cover

Loading…
Abstract DNA sequence can be defined as a succession of letters, representing the order of nucleotides within DNA, using a permutation of four DNA base codes including adenine (A), guanine (G), cytosine (C), and thymine (T). The precise code of the sequences is determined using DNA sequencing methods and technologies, which have been developed since the 1970s and currently become highly developed, advanced and highly throughput sequencing technologies. So far, DNA sequencing has greatly accelerated biological and medical research and discovery. However, in some cases DNA sequencing could produce any ambiguous and not clear enough sequencing results that make them quite difficult to be determined whether these codes are A, T, G, or C. To solve these problems, in this study we can introduce other representation of DNA codes namely Quaternion Q = (PA , PT , PG , PC ), where PA , PT , PG , PC are the probability of A, T, G, C bases that could appear in Q and PA + PT + PG + PC = 1. Furthermore, using Quaternion representations we are able to construct the improved scoring matrix for global sequence alignment processes, by applying a dot product method. Moreover, this scoring matrix produces better and higher quality of the match and mismatch score between two DNA base codes. In implementation, we applied the Needleman-Wunsch global sequence alignment algorithm using Octave, to analyze our target sequence which contains some ambiguous sequence data. The subject sequences are the DNA sequences of Streptococcus pneumoniae families obtained from the Genebank, meanwhile the target DNA sequence are received from our collaborator database. As the results we found the Quaternion representations improve the quality of the sequence alignment score and we can conclude that DNA sequence target has maximum similarity with Streptococcus pneumoniae.
AbstractList DNA sequence can be defined as a succession of letters, representing the order of nucleotides within DNA, using a permutation of four DNA base codes including adenine (A), guanine (G), cytosine (C), and thymine (T). The precise code of the sequences is determined using DNA sequencing methods and technologies, which have been developed since the 1970s and currently become highly developed, advanced and highly throughput sequencing technologies. So far, DNA sequencing has greatly accelerated biological and medical research and discovery. However, in some cases DNA sequencing could produce any ambiguous and not clear enough sequencing results that make them quite difficult to be determined whether these codes are A, T, G, or C. To solve these problems, in this study we can introduce other representation of DNA codes namely Quaternion Q = (PA, PT, PG, PC), where PA, PT, PG, PC are the probability of A, T, G, C bases that could appear in Q and PA + PT + PG + PC = 1. Furthermore, using Quaternion representations we are able to construct the improved scoring matrix for global sequence alignment processes, by applying a dot product method. Moreover, this scoring matrix produces better and higher quality of the match and mismatch score between two DNA base codes. In implementation, we applied the Needleman-Wunsch global sequence alignment algorithm using Octave, to analyze our target sequence which contains some ambiguous sequence data. The subject sequences are the DNA sequences of Streptococcus pneumoniae families obtained from the Genebank, meanwhile the target DNA sequence are received from our collaborator database. As the results we found the Quaternion representations improve the quality of the sequence alignment score and we can conclude that DNA sequence target has maximum similarity with Streptococcus pneumoniae.
DNA sequence can be defined as a succession of letters, representing the order of nucleotides within DNA, using a permutation of four DNA base codes including adenine (A), guanine (G), cytosine (C), and thymine (T). The precise code of the sequences is determined using DNA sequencing methods and technologies, which have been developed since the 1970s and currently become highly developed, advanced and highly throughput sequencing technologies. So far, DNA sequencing has greatly accelerated biological and medical research and discovery. However, in some cases DNA sequencing could produce any ambiguous and not clear enough sequencing results that make them quite difficult to be determined whether these codes are A, T, G, or C. To solve these problems, in this study we can introduce other representation of DNA codes namely Quaternion Q = (PA , PT , PG , PC ), where PA , PT , PG , PC are the probability of A, T, G, C bases that could appear in Q and PA + PT + PG + PC = 1. Furthermore, using Quaternion representations we are able to construct the improved scoring matrix for global sequence alignment processes, by applying a dot product method. Moreover, this scoring matrix produces better and higher quality of the match and mismatch score between two DNA base codes. In implementation, we applied the Needleman-Wunsch global sequence alignment algorithm using Octave, to analyze our target sequence which contains some ambiguous sequence data. The subject sequences are the DNA sequences of Streptococcus pneumoniae families obtained from the Genebank, meanwhile the target DNA sequence are received from our collaborator database. As the results we found the Quaternion representations improve the quality of the sequence alignment score and we can conclude that DNA sequence target has maximum similarity with Streptococcus pneumoniae.
Author Lestari, D.
Bustamam, A.
Ardaneswari, G.
Novianti, T.
Author_xml – sequence: 1
  givenname: D.
  surname: Lestari
  fullname: Lestari, D.
  organization: Department of Mathematics, Faculty of Mathematics and Natural Sciences (FMIPA), Universitas Indonesia, Depok 16424, Indonesia
– sequence: 2
  givenname: A.
  surname: Bustamam
  fullname: Bustamam, A.
  email: alhadi@sci.ui.ac.id
  organization: Department of Mathematics, Faculty of Mathematics and Natural Sciences (FMIPA), Universitas Indonesia, Depok 16424, Indonesia
– sequence: 3
  givenname: T.
  surname: Novianti
  fullname: Novianti, T.
  organization: Department of Mathematics, Faculty of Mathematics and Natural Sciences (FMIPA), Universitas Indonesia, Depok 16424, Indonesia
– sequence: 4
  givenname: G.
  surname: Ardaneswari
  fullname: Ardaneswari, G.
  organization: Department of Mathematics, Faculty of Mathematics and Natural Sciences (FMIPA), Universitas Indonesia, Depok 16424, Indonesia
BookMark eNp9kM1KxDAQgIMouKsefIOAN6GaSZu0PS7-gyiigreSZqc1S5t0k1TwSXxdu7rgTRgYhvmYn29Odq2zSMgxsDNgMj2Hs6wsgXO5Q2YgBCS5BLlLZoyVWcKz9G2fzENYMcbLPC9m5GsxDJ3RKhpnqWvo06gierupzBT94N2HsS2N70jXo-pM_Nxgbedq1dGA6xGtRjo1WtujjTRo5zHQxnmqLFV9bdrRjeEPjcq3GDfTn6PHITrttJ6AweLYO2sU0suHxSHZa1QX8GibD8jr9dXLxW1y_3hzd7G4TwZeFDFJhSigLGVdQC7LRoBq0hpyrVkhsMzqVIqmAZVyJkS-zFBzoTkuQaJSPANID8jJ79zp0enAEKuVG72dVlYcQE7WcpZP1OkvFbSJP66qwZte-c8KWLURX0G1Ff8f_OH8H1gNyyb9Bq-4iIY
CODEN APCPCS
ContentType Journal Article
Conference Proceeding
Copyright Author(s)
2017 Author(s). Published by AIP Publishing.
Copyright_xml – notice: Author(s)
– notice: 2017 Author(s). Published by AIP Publishing.
DBID 8FD
H8D
L7M
DOI 10.1063/1.4991226
DatabaseName Technology Research Database
Aerospace Database
Advanced Technologies Database with Aerospace
DatabaseTitle Technology Research Database
Aerospace Database
Advanced Technologies Database with Aerospace
DatabaseTitleList Technology Research Database

DeliveryMethod fulltext_linktorsrc
Discipline Physics
EISSN 1551-7616
Editor Triyono, Djoko
Sugeng, Kiki A.
Mart, Terry
Editor_xml – sequence: 1
  givenname: Terry
  surname: Mart
  fullname: Mart, Terry
  organization: Universitas Indonesia
– sequence: 2
  givenname: Djoko
  surname: Triyono
  fullname: Triyono, Djoko
  organization: Universitas Indonesia
– sequence: 3
  givenname: Kiki A.
  surname: Sugeng
  fullname: Sugeng, Kiki A.
  organization: Universitas Indonesia
ExternalDocumentID acp
Genre Conference Proceeding
GroupedDBID -~X
23M
5GY
AAAAW
AABDS
AAEUA
AAPUP
AAYIH
ABJNI
ACBRY
ACZLF
ADCTM
AEJMO
AFATG
AFHCQ
AGKCL
AGLKD
AGMXG
AGTJO
AHSDT
AJJCW
ALEPV
ALMA_UNASSIGNED_HOLDINGS
ATXIE
AWQPM
BPZLN
F5P
FDOHQ
FFFMQ
HAM
M71
M73
RIP
RQS
SJN
~02
8FD
ABJGX
ADMLS
H8D
L7M
ID FETCH-LOGICAL-p288t-35581996b81769f51af3b17cc085e94b365ff1a320557d4ec25c2ed16eaa24113
ISSN 0094-243X
IngestDate Mon Jun 30 07:04:25 EDT 2025
Sun Jul 14 10:05:17 EDT 2019
Fri Jun 21 00:14:38 EDT 2024
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Language English
License 0094-243X/2017/1862/030122/7/$30.00
Published by AIP Publishing.
LinkModel OpenURL
MeetingName INTERNATIONAL SYMPOSIUM ON CURRENT PROGRESS IN MATHEMATICS AND SCIENCES 2016 (ISCPMS 2016): Proceedings of the 2nd International Symposium on Current Progress in Mathematics and Sciences 2016
MergedId FETCHMERGED-LOGICAL-p288t-35581996b81769f51af3b17cc085e94b365ff1a320557d4ec25c2ed16eaa24113
Notes ObjectType-Conference Proceeding-1
SourceType-Conference Papers & Proceedings-1
content type line 21
OpenAccessLink https://aip.scitation.org/doi/pdf/10.1063/1.4991226
PQID 2116094707
PQPubID 2050672
PageCount 7
ParticipantIDs scitation_primary_10_1063_1_4991226
proquest_journals_2116094707
PublicationCentury 2000
PublicationDate 2017-07-10
PublicationDateYYYYMMDD 2017-07-10
PublicationDate_xml – month: 07
  year: 2017
  text: 2017-07-10
  day: 10
PublicationDecade 2010
PublicationPlace Melville
PublicationPlace_xml – name: Melville
PublicationTitle AIP conference proceedings
PublicationYear 2017
Publisher American Institute of Physics
Publisher_xml – name: American Institute of Physics
References Watson, Crick (c2) 1953
Cohen (c6) 2004
Needleman, Chrisrtian (c8) 1969
Brodzik (c5) 2007
Heyer (c7) 2008
Shu, Ouw (c4) 2004
References_xml – start-page: 122
  year: 2004
  ident: c6
  publication-title: ACM Comput. Surv.
– start-page: 737
  year: 1953
  ident: c2
  publication-title: Nature
– start-page: 101
  year: 2008
  ident: c7
  publication-title: PRIMUS
– start-page: 1423
  year: 2004
  ident: c4
  publication-title: Bull. Math. Biol.
– start-page: 443
  year: 1969
  ident: c8
  publication-title: J. Mol. Biol.
– start-page: 694
  year: 2007
  ident: c5
  publication-title: Bioinformatics
SSID ssj0029778
Score 2.1070607
Snippet DNA sequence can be defined as a succession of letters, representing the order of nucleotides within DNA, using a permutation of four DNA base codes including...
SourceID proquest
scitation
SourceType Aggregation Database
Enrichment Source
Publisher
SubjectTerms Alignment
Codes
Deoxyribonucleic acid
DNA
Gene sequencing
Medical research
Nucleotides
Permutations
Quaternions
Representations
Streptococcus infections
Thymine
Title Application of Quaternion in improving the quality of global sequence alignment scores for an ambiguous sequence target in Streptococcus pneumoniae DNA
URI http://dx.doi.org/10.1063/1.4991226
https://www.proquest.com/docview/2116094707
Volume 1862
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1db9MwFLVKJwRvfAyxMZAleKsysOMkzWPFNgbqyhCt1LfITuwpD0ujNgGJP8IP4I9yXTt2yiokkKqoSq6cyPfEPr4-9wahN6JgkSoiFhDOacBSzgIuUhVwThQhMkl5opOTr2bx5YJ9WkbLweBXT7XUNuI0_7E3r-R_vArnwK86S_YfPOsahRPwH_wLR_AwHO_6eO9UM_l4rYXjXbFYb7PpY2Hid6k1OfzSch0GtCrH0oUVNAc1SZbbbXdbK6QTW4_gwo3RDmx07cuNFWCO-K0ob1otpXWmRl-uW9e73nWzgmE3B4O6ki30Tsnl6GzmsDOFmYnbjHcXHtCJXbcGrc5wBs8JSCh3xN2TdcFhvP5uW_jQj2OQbYDUKlp96gCvdkUSWxls3g9bwnI0oGz7DWGYv-ygHZEgiU3Oph_VY_onfu9MF8DPdOTiFJZ9hNI9Jblnn7OLxXSazc-X83vogIZAqIboYHJ2Nf3q1vVAoc2Ebx-tK2AVh29d0zsLmAfAbozQosdl5o_Qoc_yxNcOMI_RQFZP0H3bF0_Rzx5q8Ephjxpcwq9DDQbUYIsabWZQgzsoYIcabFCDATWYV9ihxpsa1OjWd1CDPWowoOYQLS7O5-8vA_sZj6Cm43ET6AL-WusuxiSJUxURrkJBkjwHti9TJsI4UorwkOpycAWTOY1yKgsSSxg-GCHhMzSsVpV8jnAqY1XAmkTwXAETDYVIRRKxlIRAsoDcHqGTrpcz-55uMkpIDK5J3iVH6LXr-aw21VyyrQojDjOSWVfttfq2WnuLrC7U8d9v9QI99CA_QcNm3cqXQF8b8crC5zeQ-6Zm
linkProvider EBSCOhost
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=proceeding&rft.title=AIP+conference+proceedings&rft.atitle=Application+of+Quaternion+in+improving+the+quality+of+global+sequence+alignment+scores+for+an+ambiguous+sequence+target+in+Streptococcus+pneumoniae+DNA&rft.au=Lestari%2C+D&rft.au=Bustamam%2C+A&rft.au=Novianti%2C+T&rft.au=Ardaneswari%2C+G&rft.date=2017-07-10&rft.pub=American+Institute+of+Physics&rft.issn=0094-243X&rft.eissn=1551-7616&rft.volume=1862&rft.issue=1&rft_id=info:doi/10.1063%2F1.4991226&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0094-243X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0094-243X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0094-243X&client=summon