Mutually Uncorrelated Primers for DNA-Based Data Storage
We introduce the notion of weakly mutually uncorrelated (WMU) sequences, motivated by applications in DNA-based data storage systems and synchronization between communication devices. WMU sequences are characterized by the property that no sufficiently long suffix of one sequence is the prefix of th...
Saved in:
Published in | IEEE transactions on information theory Vol. 64; no. 9; pp. 6283 - 6296 |
---|---|
Main Authors | , , , |
Format | Journal Article |
Language | English |
Published |
New York
IEEE
01.09.2018
The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
Subjects | |
Online Access | Get full text |
ISSN | 0018-9448 1557-9654 |
DOI | 10.1109/TIT.2018.2792488 |
Cover
Abstract | We introduce the notion of weakly mutually uncorrelated (WMU) sequences, motivated by applications in DNA-based data storage systems and synchronization between communication devices. WMU sequences are characterized by the property that no sufficiently long suffix of one sequence is the prefix of the same or another sequence. WMU sequences used for primer design in DNA-based data storage systems are also required to be at large mutual Hamming distance from each other, have balanced compositions of symbols, and avoid primer-dimer byproducts. We derive bounds on the size of WMU and various constrained WMU codes and present a number of constructions for balanced, error-correcting, primer-dimer free WMU codes using Dyck paths, prefix-synchronized, and cyclic codes. |
---|---|
AbstractList | We introduce the notion of weakly mutually uncorrelated (WMU) sequences, motivated by applications in DNA-based data storage systems and synchronization between communication devices. WMU sequences are characterized by the property that no sufficiently long suffix of one sequence is the prefix of the same or another sequence. WMU sequences used for primer design in DNA-based data storage systems are also required to be at large mutual Hamming distance from each other, have balanced compositions of symbols, and avoid primer-dimer byproducts. We derive bounds on the size of WMU and various constrained WMU codes and present a number of constructions for balanced, error-correcting, primer-dimer free WMU codes using Dyck paths, prefix-synchronized, and cyclic codes. |
Author | Tabatabaei Yazdi, S. M. H. Gabrys, Ryan Kiah, Han Mao Milenkovic, Olgica |
Author_xml | – sequence: 1 givenname: S. M. H. surname: Tabatabaei Yazdi fullname: Tabatabaei Yazdi, S. M. H. organization: Electr. & Comput. Eng. Dept., Univ. of Illinois at Urbana-Champaign, Champaign, IL, USA – sequence: 2 givenname: Han Mao surname: Kiah fullname: Kiah, Han Mao organization: Sch. of Phys. & Math. Sci., Nanyang Technol. Univ., Singapore, Singapore – sequence: 3 givenname: Ryan surname: Gabrys fullname: Gabrys, Ryan email: ryan.gabrys@gmail.com organization: Electr. & Comput. Eng. Dept., Univ. of Illinois at Urbana-Champaign, Champaign, IL, USA – sequence: 4 givenname: Olgica surname: Milenkovic fullname: Milenkovic, Olgica organization: Electr. & Comput. Eng. Dept., Univ. of Illinois at Urbana-Champaign, Champaign, IL, USA |
BookMark | eNp9kLtPwzAQxi1UJNrCjsQSiTnFduLXWFoelcpDop0t17mgVCEutjP0v8dVKwYGptPdfd99ut8IDTrXAULXBE8IweputVhNKCZyQoWipZRnaEgYE7nirBygIU6rXJWlvECjELapLRmhQyRf-tibtt1n684676E1Ears3Tdf4ENWO5_NX6f5vQlpOjfRZB_RefMJl-i8Nm2Aq1Mdo_Xjw2r2nC_fnhaz6TK3VJGYF0B4JSpRc2MoFHKz4XhTM2WVYGCZERZvgAMppVU1r2ipCkKAWia5ELKqizG6Pd7deffdQ4h663rfpUhNscKSFKIkSYWPKutdCB5qvUsfGL_XBOsDH5346AMffeKTLPyPxTbRxMZ10Zum_c94czQ2APCbIyljnKviBzXmczA |
CODEN | IETTAW |
CitedBy_id | crossref_primary_10_1007_s10623_024_01445_3 crossref_primary_10_1109_ACCESS_2020_2970838 crossref_primary_10_1109_TIT_2020_2977915 crossref_primary_10_1109_TIT_2022_3204025 crossref_primary_10_1016_j_isci_2023_106231 crossref_primary_10_1109_TMBMC_2024_3396404 crossref_primary_10_1002_cplu_202200183 crossref_primary_10_1109_TIT_2020_2996377 crossref_primary_10_18231_j_ijcaap_2021_023 crossref_primary_10_1109_TIT_2020_3035032 crossref_primary_10_1109_TIT_2023_3296963 crossref_primary_10_1109_TIT_2019_2935973 crossref_primary_10_1007_s10623_024_01377_y crossref_primary_10_3390_e24081151 crossref_primary_10_1049_iet_com_2018_6053 crossref_primary_10_1007_s12095_024_00754_7 crossref_primary_10_1109_TNB_2021_3056351 crossref_primary_10_3390_ijms21062191 crossref_primary_10_1109_TIT_2023_3304712 crossref_primary_10_1109_JSAIT_2023_3294423 crossref_primary_10_1109_TCBB_2021_3127271 crossref_primary_10_1109_ACCESS_2023_3332254 crossref_primary_10_1109_TIT_2024_3453935 crossref_primary_10_1109_ACCESS_2020_2995812 crossref_primary_10_1007_s00453_025_01295_y crossref_primary_10_1007_s40314_020_1120_1 crossref_primary_10_1109_TMBMC_2024_3403488 crossref_primary_10_1016_j_celrep_2024_113699 crossref_primary_10_1016_j_disc_2024_114236 crossref_primary_10_1137_19M1253472 crossref_primary_10_1109_TCOMM_2024_3367748 crossref_primary_10_1109_TCBB_2020_3011582 crossref_primary_10_1137_19M1241106 crossref_primary_10_1109_TIT_2021_3119584 crossref_primary_10_1016_j_tcs_2023_113925 crossref_primary_10_1016_j_compbiomed_2023_107439 crossref_primary_10_1038_s41467_022_30140_x crossref_primary_10_1109_TIT_2023_3319010 crossref_primary_10_1007_s10623_023_01344_z crossref_primary_10_29252_jsdp_17_2_112 crossref_primary_10_1038_s41540_022_00233_w crossref_primary_10_1186_s12859_024_05943_y crossref_primary_10_1007_s40314_024_03055_0 |
Cites_doi | 10.1109/ISIT.2017.8007103 10.1109/TIT.2016.2555321 10.1109/TIT.2012.2189479 10.1109/TIT.2015.2456634 10.1109/TIT.2013.2252952 10.1109/ICC.2004.1312542 10.1038/nature11875 10.1038/s41598-017-05188-1 10.1002/j.1538-7305.1952.tb01393.x 10.2144/04372ST03 10.1093/nar/gkj454 10.1109/26.891223 10.1016/B978-1-4832-3187-7.50007-6 10.1109/TIT.1973.1055064 10.1109/TIT.2017.2700847 10.1126/science.1226355 10.1109/18.135636 10.1137/0135034 10.1109/ISIT.2005.1523340 10.1109/TCOM.1972.1091127 10.1109/TIT.1960.1057587 10.1109/TMBMC.2016.2537305 10.1007/11779360_9 10.1109/ISIT.2016.7541778 |
ContentType | Journal Article |
Copyright | Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2018 |
Copyright_xml | – notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2018 |
DBID | 97E RIA RIE AAYXX CITATION 7SC 7SP 8FD JQ2 L7M L~C L~D |
DOI | 10.1109/TIT.2018.2792488 |
DatabaseName | IEEE Xplore (IEEE) IEEE All-Society Periodicals Package (ASPP) 1998–Present IEEE Xplore CrossRef Computer and Information Systems Abstracts Electronics & Communications Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional |
DatabaseTitle | CrossRef Technology Research Database Computer and Information Systems Abstracts – Academic Electronics & Communications Abstracts ProQuest Computer Science Collection Computer and Information Systems Abstracts Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Professional |
DatabaseTitleList | Technology Research Database |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Engineering Computer Science |
EISSN | 1557-9654 |
EndPage | 6296 |
ExternalDocumentID | 10_1109_TIT_2018_2792488 8255669 |
Genre | orig-research |
GrantInformation_xml | – fundername: Singapore Ministry of Education grantid: MOE2015-T2-2-086; MOE2016-T1-001-156 – fundername: National Science Foundation grantid: CCF 16-18366 funderid: 10.13039/100000001 |
GroupedDBID | -~X .DC 0R~ 29I 3EH 4.4 5GY 5VS 6IK 97E AAJGR AARMG AASAJ AAWTH ABAZT ABFSI ABQJQ ABVLG ACGFO ACGFS ACGOD ACIWK AENEX AETEA AETIX AGQYO AGSQL AHBIQ AI. AIBXA AKJIK AKQYR ALLEH ALMA_UNASSIGNED_HOLDINGS ASUFR ATWAV BEFXN BFFAM BGNUA BKEBE BPEOZ CS3 DU5 E.L EBS EJD F5P HZ~ H~9 IAAWW IBMZZ ICLAB IDIHD IFIPE IFJZH IPLJI JAVBF LAI M43 MS~ O9- OCL P2P PQQKQ RIA RIE RNS RXW TAE TN5 VH1 VJK AAYOK AAYXX CITATION RIG 7SC 7SP 8FD JQ2 L7M L~C L~D |
ID | FETCH-LOGICAL-c291t-3e16d7d7f6aa2e38bb60bf59c975ec5a7c0be6e148c9f6d249311e2c586778df3 |
IEDL.DBID | RIE |
ISSN | 0018-9448 |
IngestDate | Wed Sep 03 09:37:02 EDT 2025 Tue Jul 01 02:16:10 EDT 2025 Thu Apr 24 23:06:25 EDT 2025 Wed Aug 27 06:00:50 EDT 2025 |
IsDoiOpenAccess | false |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 9 |
Language | English |
License | https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html https://doi.org/10.15223/policy-029 https://doi.org/10.15223/policy-037 |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c291t-3e16d7d7f6aa2e38bb60bf59c975ec5a7c0be6e148c9f6d249311e2c586778df3 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
ORCID | 0000-0002-9197-3371 0000-0001-5611-0848 |
PQID | 2090813741 |
PQPubID | 36024 |
PageCount | 14 |
ParticipantIDs | crossref_primary_10_1109_TIT_2018_2792488 proquest_journals_2090813741 ieee_primary_8255669 crossref_citationtrail_10_1109_TIT_2018_2792488 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | 2018-09-01 |
PublicationDateYYYYMMDD | 2018-09-01 |
PublicationDate_xml | – month: 09 year: 2018 text: 2018-09-01 day: 01 |
PublicationDecade | 2010 |
PublicationPlace | New York |
PublicationPlace_xml | – name: New York |
PublicationTitle | IEEE transactions on information theory |
PublicationTitleAbbrev | TIT |
PublicationYear | 2018 |
Publisher | IEEE The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
Publisher_xml | – name: IEEE – name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
References | ref13 ref12 ref15 ref14 ref10 ref2 ref17 ref16 tavares (ref28) 1968 ref18 levenshtein (ref1) 1964; 12 ref24 ref23 ref26 ref25 ref20 ref22 ref21 immink (ref19) 2004 varshamov (ref27) 1957; 117 a (ref29) 1992; 38 ref8 ref7 ref9 ref4 yazdi (ref11) 2015; 5 ref3 ref6 ref5 |
References_xml | – ident: ref20 doi: 10.1109/ISIT.2017.8007103 – ident: ref13 doi: 10.1109/TIT.2016.2555321 – volume: 5 year: 2015 ident: ref11 article-title: A rewritable, random-access DNA-based storage system publication-title: Sci Rep – volume: 117 start-page: 739 year: 1957 ident: ref27 article-title: Estimate of the number of signals in error correcting codes publication-title: Dokl Akad Nauk SSSR – ident: ref6 doi: 10.1109/TIT.2012.2189479 – ident: ref8 doi: 10.1109/TIT.2015.2456634 – ident: ref7 doi: 10.1109/TIT.2013.2252952 – ident: ref5 doi: 10.1109/ICC.2004.1312542 – ident: ref10 doi: 10.1038/nature11875 – year: 1968 ident: ref28 article-title: A study of synchronization techniques for binary cyclic codes – ident: ref12 doi: 10.1038/s41598-017-05188-1 – ident: ref26 doi: 10.1002/j.1538-7305.1952.tb01393.x – ident: ref18 doi: 10.2144/04372ST03 – ident: ref17 doi: 10.1093/nar/gkj454 – ident: ref4 doi: 10.1109/26.891223 – ident: ref25 doi: 10.1016/B978-1-4832-3187-7.50007-6 – ident: ref3 doi: 10.1109/TIT.1973.1055064 – ident: ref15 doi: 10.1109/TIT.2017.2700847 – ident: ref9 doi: 10.1126/science.1226355 – volume: 38 start-page: 940 year: 1992 ident: ref29 article-title: Constructions of binary constant-weight cyclic codes and cyclically permutable codes publication-title: IEEE Trans Inf Theory doi: 10.1109/18.135636 – ident: ref24 doi: 10.1137/0135034 – year: 2004 ident: ref19 publication-title: Codes for Mass Data Storage Systems – volume: 12 start-page: 125 year: 1964 ident: ref1 article-title: Decoding automata, invariant with respect to the initial state publication-title: Problemy Kibernet – ident: ref22 doi: 10.1109/ISIT.2005.1523340 – ident: ref2 doi: 10.1109/TCOM.1972.1091127 – ident: ref23 doi: 10.1109/TIT.1960.1057587 – ident: ref16 doi: 10.1109/TMBMC.2016.2537305 – ident: ref21 doi: 10.1007/11779360_9 – ident: ref14 doi: 10.1109/ISIT.2016.7541778 |
SSID | ssj0014512 |
Score | 2.525873 |
Snippet | We introduce the notion of weakly mutually uncorrelated (WMU) sequences, motivated by applications in DNA-based data storage systems and synchronization... |
SourceID | proquest crossref ieee |
SourceType | Aggregation Database Enrichment Source Index Database Publisher |
StartPage | 6283 |
SubjectTerms | bioinformatics Biological information theory Block codes channel coding constrained coding Data storage Data storage systems Deoxyribonucleic acid Dimers DNA DNA-based data storage systems Electronic devices Error correction Gene sequencing Hamming distance Memory Storage systems Synchronism Synchronization |
Title | Mutually Uncorrelated Primers for DNA-Based Data Storage |
URI | https://ieeexplore.ieee.org/document/8255669 https://www.proquest.com/docview/2090813741 |
Volume | 64 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwED5BJxh4FUShoAwsSCR1Xk48FkoFSFRItFK3yI_LAmpRSQb49djOAwQIsWWwLesevrvc3XcAZ3kQRpKH1PVTmrtRElBX5Fy4nCgSqUhiKiza54TezKK7eTxfg4u2FwYRbfEZeubT5vLVUpbmV9kgNXhZlK3DuhazqlerzRhEsV8hg_tagXXM0aQkCRtMb6emhiv1DFheZGesfJogO1Plx0Nsrct4G-6be1VFJU9eWQhPvn-DbPzvxXdgq3YznWElF7uwhos92G5GODi1Ru_B5hc8wi6k96VpJ3l-c2YG3tL2uaByHswIgNWro_1bZzQZupfa8ilnxAvuPOqQXb9I-zAbX0-vbtx6tIIrA-YXbog-VYlKcsp5gGEqBCUij5lkSYwy5okkAinqWEmynCodo4W-j4GMDfxdqvLwADqL5QIPTW0U40gIpkQQbekkZ4EI_Vzq0CxWAac9GDTUzmSNO27GXzxnNv4gLNP8yQx_spo_PThvd7xUmBt_rO0acrfrakr3oN8wNKuV8lXvY9oBCrUPdfT7rmPYMGdXJWR96BSrEk-0z1GIUytsH49V0Ww |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LT8MwDLZgHIAD4ykGA3rggkS39JU2x8FA47EJiU3arcrDvTBtCLoD_HqStB0IEOLWQ6xEdhzbtf0Z4DTzg1DygLpeQjM3jH3qiowLlxNFQhVKTIRF-xzQ3ii8HUfjJThf9MIgoi0-w5b5tLl8NZNz86usnRi8LMqWYUXb_TAqurUWOYMw8gpscE-rsI46qqQkYe3hzdBUcSUtA5cX2ikrn0bITlX58RRb-3Jdh351sqKs5Kk1z0VLvn8Dbfzv0Tdho3Q0nU5xM7ZgCafbUK-GODilTm_D-hdEwh1I-nPTUDJ5c0YG4NJ2uqByHswQgJdXR3u4TnfQcS-07VNOl-fcedRBu36TdmF0fTW87LnlcAVX-szL3QA9qmIVZ5RzH4NECEpEFjHJ4ghlxGNJBFLU0ZJkGVU6Sgs8D30ZGQC8RGXBHtSmsynum-ooxpEQTIgg2tZJznwReJnUwVmkfE4b0K64ncoSedwMwJikNgIhLNXySY180lI-DThbUDwXqBt_rN0x7F6sKzndgGYl0LRUy1dNx7QLFGgv6uB3qhNY7Q379-n9zeDuENbMPkVBWRNq-cscj7QHkotje_E-ALNC1Lk |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Mutually+Uncorrelated+Primers+for+DNA-Based+Data+Storage&rft.jtitle=IEEE+transactions+on+information+theory&rft.au=Tabatabaei+Yazdi%2C+S.+M.+H.&rft.au=Kiah%2C+Han+Mao&rft.au=Gabrys%2C+Ryan&rft.au=Milenkovic%2C+Olgica&rft.date=2018-09-01&rft.issn=0018-9448&rft.eissn=1557-9654&rft.volume=64&rft.issue=9&rft.spage=6283&rft.epage=6296&rft_id=info:doi/10.1109%2FTIT.2018.2792488&rft.externalDBID=n%2Fa&rft.externalDocID=10_1109_TIT_2018_2792488 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0018-9448&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0018-9448&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0018-9448&client=summon |