Mutually Uncorrelated Primers for DNA-Based Data Storage

We introduce the notion of weakly mutually uncorrelated (WMU) sequences, motivated by applications in DNA-based data storage systems and synchronization between communication devices. WMU sequences are characterized by the property that no sufficiently long suffix of one sequence is the prefix of th...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on information theory Vol. 64; no. 9; pp. 6283 - 6296
Main Authors Tabatabaei Yazdi, S. M. H., Kiah, Han Mao, Gabrys, Ryan, Milenkovic, Olgica
Format Journal Article
LanguageEnglish
Published New York IEEE 01.09.2018
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text
ISSN0018-9448
1557-9654
DOI10.1109/TIT.2018.2792488

Cover

Abstract We introduce the notion of weakly mutually uncorrelated (WMU) sequences, motivated by applications in DNA-based data storage systems and synchronization between communication devices. WMU sequences are characterized by the property that no sufficiently long suffix of one sequence is the prefix of the same or another sequence. WMU sequences used for primer design in DNA-based data storage systems are also required to be at large mutual Hamming distance from each other, have balanced compositions of symbols, and avoid primer-dimer byproducts. We derive bounds on the size of WMU and various constrained WMU codes and present a number of constructions for balanced, error-correcting, primer-dimer free WMU codes using Dyck paths, prefix-synchronized, and cyclic codes.
AbstractList We introduce the notion of weakly mutually uncorrelated (WMU) sequences, motivated by applications in DNA-based data storage systems and synchronization between communication devices. WMU sequences are characterized by the property that no sufficiently long suffix of one sequence is the prefix of the same or another sequence. WMU sequences used for primer design in DNA-based data storage systems are also required to be at large mutual Hamming distance from each other, have balanced compositions of symbols, and avoid primer-dimer byproducts. We derive bounds on the size of WMU and various constrained WMU codes and present a number of constructions for balanced, error-correcting, primer-dimer free WMU codes using Dyck paths, prefix-synchronized, and cyclic codes.
Author Tabatabaei Yazdi, S. M. H.
Gabrys, Ryan
Kiah, Han Mao
Milenkovic, Olgica
Author_xml – sequence: 1
  givenname: S. M. H.
  surname: Tabatabaei Yazdi
  fullname: Tabatabaei Yazdi, S. M. H.
  organization: Electr. & Comput. Eng. Dept., Univ. of Illinois at Urbana-Champaign, Champaign, IL, USA
– sequence: 2
  givenname: Han Mao
  surname: Kiah
  fullname: Kiah, Han Mao
  organization: Sch. of Phys. & Math. Sci., Nanyang Technol. Univ., Singapore, Singapore
– sequence: 3
  givenname: Ryan
  surname: Gabrys
  fullname: Gabrys, Ryan
  email: ryan.gabrys@gmail.com
  organization: Electr. & Comput. Eng. Dept., Univ. of Illinois at Urbana-Champaign, Champaign, IL, USA
– sequence: 4
  givenname: Olgica
  surname: Milenkovic
  fullname: Milenkovic, Olgica
  organization: Electr. & Comput. Eng. Dept., Univ. of Illinois at Urbana-Champaign, Champaign, IL, USA
BookMark eNp9kLtPwzAQxi1UJNrCjsQSiTnFduLXWFoelcpDop0t17mgVCEutjP0v8dVKwYGptPdfd99ut8IDTrXAULXBE8IweputVhNKCZyQoWipZRnaEgYE7nirBygIU6rXJWlvECjELapLRmhQyRf-tibtt1n684676E1Ears3Tdf4ENWO5_NX6f5vQlpOjfRZB_RefMJl-i8Nm2Aq1Mdo_Xjw2r2nC_fnhaz6TK3VJGYF0B4JSpRc2MoFHKz4XhTM2WVYGCZERZvgAMppVU1r2ipCkKAWia5ELKqizG6Pd7deffdQ4h663rfpUhNscKSFKIkSYWPKutdCB5qvUsfGL_XBOsDH5346AMffeKTLPyPxTbRxMZ10Zum_c94czQ2APCbIyljnKviBzXmczA
CODEN IETTAW
CitedBy_id crossref_primary_10_1007_s10623_024_01445_3
crossref_primary_10_1109_ACCESS_2020_2970838
crossref_primary_10_1109_TIT_2020_2977915
crossref_primary_10_1109_TIT_2022_3204025
crossref_primary_10_1016_j_isci_2023_106231
crossref_primary_10_1109_TMBMC_2024_3396404
crossref_primary_10_1002_cplu_202200183
crossref_primary_10_1109_TIT_2020_2996377
crossref_primary_10_18231_j_ijcaap_2021_023
crossref_primary_10_1109_TIT_2020_3035032
crossref_primary_10_1109_TIT_2023_3296963
crossref_primary_10_1109_TIT_2019_2935973
crossref_primary_10_1007_s10623_024_01377_y
crossref_primary_10_3390_e24081151
crossref_primary_10_1049_iet_com_2018_6053
crossref_primary_10_1007_s12095_024_00754_7
crossref_primary_10_1109_TNB_2021_3056351
crossref_primary_10_3390_ijms21062191
crossref_primary_10_1109_TIT_2023_3304712
crossref_primary_10_1109_JSAIT_2023_3294423
crossref_primary_10_1109_TCBB_2021_3127271
crossref_primary_10_1109_ACCESS_2023_3332254
crossref_primary_10_1109_TIT_2024_3453935
crossref_primary_10_1109_ACCESS_2020_2995812
crossref_primary_10_1007_s00453_025_01295_y
crossref_primary_10_1007_s40314_020_1120_1
crossref_primary_10_1109_TMBMC_2024_3403488
crossref_primary_10_1016_j_celrep_2024_113699
crossref_primary_10_1016_j_disc_2024_114236
crossref_primary_10_1137_19M1253472
crossref_primary_10_1109_TCOMM_2024_3367748
crossref_primary_10_1109_TCBB_2020_3011582
crossref_primary_10_1137_19M1241106
crossref_primary_10_1109_TIT_2021_3119584
crossref_primary_10_1016_j_tcs_2023_113925
crossref_primary_10_1016_j_compbiomed_2023_107439
crossref_primary_10_1038_s41467_022_30140_x
crossref_primary_10_1109_TIT_2023_3319010
crossref_primary_10_1007_s10623_023_01344_z
crossref_primary_10_29252_jsdp_17_2_112
crossref_primary_10_1038_s41540_022_00233_w
crossref_primary_10_1186_s12859_024_05943_y
crossref_primary_10_1007_s40314_024_03055_0
Cites_doi 10.1109/ISIT.2017.8007103
10.1109/TIT.2016.2555321
10.1109/TIT.2012.2189479
10.1109/TIT.2015.2456634
10.1109/TIT.2013.2252952
10.1109/ICC.2004.1312542
10.1038/nature11875
10.1038/s41598-017-05188-1
10.1002/j.1538-7305.1952.tb01393.x
10.2144/04372ST03
10.1093/nar/gkj454
10.1109/26.891223
10.1016/B978-1-4832-3187-7.50007-6
10.1109/TIT.1973.1055064
10.1109/TIT.2017.2700847
10.1126/science.1226355
10.1109/18.135636
10.1137/0135034
10.1109/ISIT.2005.1523340
10.1109/TCOM.1972.1091127
10.1109/TIT.1960.1057587
10.1109/TMBMC.2016.2537305
10.1007/11779360_9
10.1109/ISIT.2016.7541778
ContentType Journal Article
Copyright Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2018
Copyright_xml – notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2018
DBID 97E
RIA
RIE
AAYXX
CITATION
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
DOI 10.1109/TIT.2018.2792488
DatabaseName IEEE Xplore (IEEE)
IEEE All-Society Periodicals Package (ASPP) 1998–Present
IEEE Xplore
CrossRef
Computer and Information Systems Abstracts
Electronics & Communications Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle CrossRef
Technology Research Database
Computer and Information Systems Abstracts – Academic
Electronics & Communications Abstracts
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts Professional
DatabaseTitleList
Technology Research Database
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Computer Science
EISSN 1557-9654
EndPage 6296
ExternalDocumentID 10_1109_TIT_2018_2792488
8255669
Genre orig-research
GrantInformation_xml – fundername: Singapore Ministry of Education
  grantid: MOE2015-T2-2-086; MOE2016-T1-001-156
– fundername: National Science Foundation
  grantid: CCF 16-18366
  funderid: 10.13039/100000001
GroupedDBID -~X
.DC
0R~
29I
3EH
4.4
5GY
5VS
6IK
97E
AAJGR
AARMG
AASAJ
AAWTH
ABAZT
ABFSI
ABQJQ
ABVLG
ACGFO
ACGFS
ACGOD
ACIWK
AENEX
AETEA
AETIX
AGQYO
AGSQL
AHBIQ
AI.
AIBXA
AKJIK
AKQYR
ALLEH
ALMA_UNASSIGNED_HOLDINGS
ASUFR
ATWAV
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CS3
DU5
E.L
EBS
EJD
F5P
HZ~
H~9
IAAWW
IBMZZ
ICLAB
IDIHD
IFIPE
IFJZH
IPLJI
JAVBF
LAI
M43
MS~
O9-
OCL
P2P
PQQKQ
RIA
RIE
RNS
RXW
TAE
TN5
VH1
VJK
AAYOK
AAYXX
CITATION
RIG
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
ID FETCH-LOGICAL-c291t-3e16d7d7f6aa2e38bb60bf59c975ec5a7c0be6e148c9f6d249311e2c586778df3
IEDL.DBID RIE
ISSN 0018-9448
IngestDate Wed Sep 03 09:37:02 EDT 2025
Tue Jul 01 02:16:10 EDT 2025
Thu Apr 24 23:06:25 EDT 2025
Wed Aug 27 06:00:50 EDT 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 9
Language English
License https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html
https://doi.org/10.15223/policy-029
https://doi.org/10.15223/policy-037
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c291t-3e16d7d7f6aa2e38bb60bf59c975ec5a7c0be6e148c9f6d249311e2c586778df3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ORCID 0000-0002-9197-3371
0000-0001-5611-0848
PQID 2090813741
PQPubID 36024
PageCount 14
ParticipantIDs crossref_primary_10_1109_TIT_2018_2792488
proquest_journals_2090813741
ieee_primary_8255669
crossref_citationtrail_10_1109_TIT_2018_2792488
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2018-09-01
PublicationDateYYYYMMDD 2018-09-01
PublicationDate_xml – month: 09
  year: 2018
  text: 2018-09-01
  day: 01
PublicationDecade 2010
PublicationPlace New York
PublicationPlace_xml – name: New York
PublicationTitle IEEE transactions on information theory
PublicationTitleAbbrev TIT
PublicationYear 2018
Publisher IEEE
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Publisher_xml – name: IEEE
– name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
References ref13
ref12
ref15
ref14
ref10
ref2
ref17
ref16
tavares (ref28) 1968
ref18
levenshtein (ref1) 1964; 12
ref24
ref23
ref26
ref25
ref20
ref22
ref21
immink (ref19) 2004
varshamov (ref27) 1957; 117
a (ref29) 1992; 38
ref8
ref7
ref9
ref4
yazdi (ref11) 2015; 5
ref3
ref6
ref5
References_xml – ident: ref20
  doi: 10.1109/ISIT.2017.8007103
– ident: ref13
  doi: 10.1109/TIT.2016.2555321
– volume: 5
  year: 2015
  ident: ref11
  article-title: A rewritable, random-access DNA-based storage system
  publication-title: Sci Rep
– volume: 117
  start-page: 739
  year: 1957
  ident: ref27
  article-title: Estimate of the number of signals in error correcting codes
  publication-title: Dokl Akad Nauk SSSR
– ident: ref6
  doi: 10.1109/TIT.2012.2189479
– ident: ref8
  doi: 10.1109/TIT.2015.2456634
– ident: ref7
  doi: 10.1109/TIT.2013.2252952
– ident: ref5
  doi: 10.1109/ICC.2004.1312542
– ident: ref10
  doi: 10.1038/nature11875
– year: 1968
  ident: ref28
  article-title: A study of synchronization techniques for binary cyclic codes
– ident: ref12
  doi: 10.1038/s41598-017-05188-1
– ident: ref26
  doi: 10.1002/j.1538-7305.1952.tb01393.x
– ident: ref18
  doi: 10.2144/04372ST03
– ident: ref17
  doi: 10.1093/nar/gkj454
– ident: ref4
  doi: 10.1109/26.891223
– ident: ref25
  doi: 10.1016/B978-1-4832-3187-7.50007-6
– ident: ref3
  doi: 10.1109/TIT.1973.1055064
– ident: ref15
  doi: 10.1109/TIT.2017.2700847
– ident: ref9
  doi: 10.1126/science.1226355
– volume: 38
  start-page: 940
  year: 1992
  ident: ref29
  article-title: Constructions of binary constant-weight cyclic codes and cyclically permutable codes
  publication-title: IEEE Trans Inf Theory
  doi: 10.1109/18.135636
– ident: ref24
  doi: 10.1137/0135034
– year: 2004
  ident: ref19
  publication-title: Codes for Mass Data Storage Systems
– volume: 12
  start-page: 125
  year: 1964
  ident: ref1
  article-title: Decoding automata, invariant with respect to the initial state
  publication-title: Problemy Kibernet
– ident: ref22
  doi: 10.1109/ISIT.2005.1523340
– ident: ref2
  doi: 10.1109/TCOM.1972.1091127
– ident: ref23
  doi: 10.1109/TIT.1960.1057587
– ident: ref16
  doi: 10.1109/TMBMC.2016.2537305
– ident: ref21
  doi: 10.1007/11779360_9
– ident: ref14
  doi: 10.1109/ISIT.2016.7541778
SSID ssj0014512
Score 2.525873
Snippet We introduce the notion of weakly mutually uncorrelated (WMU) sequences, motivated by applications in DNA-based data storage systems and synchronization...
SourceID proquest
crossref
ieee
SourceType Aggregation Database
Enrichment Source
Index Database
Publisher
StartPage 6283
SubjectTerms bioinformatics
Biological information theory
Block codes
channel coding
constrained coding
Data storage
Data storage systems
Deoxyribonucleic acid
Dimers
DNA
DNA-based data storage systems
Electronic devices
Error correction
Gene sequencing
Hamming distance
Memory
Storage systems
Synchronism
Synchronization
Title Mutually Uncorrelated Primers for DNA-Based Data Storage
URI https://ieeexplore.ieee.org/document/8255669
https://www.proquest.com/docview/2090813741
Volume 64
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwED5BJxh4FUShoAwsSCR1Xk48FkoFSFRItFK3yI_LAmpRSQb49djOAwQIsWWwLesevrvc3XcAZ3kQRpKH1PVTmrtRElBX5Fy4nCgSqUhiKiza54TezKK7eTxfg4u2FwYRbfEZeubT5vLVUpbmV9kgNXhZlK3DuhazqlerzRhEsV8hg_tagXXM0aQkCRtMb6emhiv1DFheZGesfJogO1Plx0Nsrct4G-6be1VFJU9eWQhPvn-DbPzvxXdgq3YznWElF7uwhos92G5GODi1Ru_B5hc8wi6k96VpJ3l-c2YG3tL2uaByHswIgNWro_1bZzQZupfa8ilnxAvuPOqQXb9I-zAbX0-vbtx6tIIrA-YXbog-VYlKcsp5gGEqBCUij5lkSYwy5okkAinqWEmynCodo4W-j4GMDfxdqvLwADqL5QIPTW0U40gIpkQQbekkZ4EI_Vzq0CxWAac9GDTUzmSNO27GXzxnNv4gLNP8yQx_spo_PThvd7xUmBt_rO0acrfrakr3oN8wNKuV8lXvY9oBCrUPdfT7rmPYMGdXJWR96BSrEk-0z1GIUytsH49V0Ww
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LT8MwDLZgHIAD4ykGA3rggkS39JU2x8FA47EJiU3arcrDvTBtCLoD_HqStB0IEOLWQ6xEdhzbtf0Z4DTzg1DygLpeQjM3jH3qiowLlxNFQhVKTIRF-xzQ3ii8HUfjJThf9MIgoi0-w5b5tLl8NZNz86usnRi8LMqWYUXb_TAqurUWOYMw8gpscE-rsI46qqQkYe3hzdBUcSUtA5cX2ikrn0bITlX58RRb-3Jdh351sqKs5Kk1z0VLvn8Dbfzv0Tdho3Q0nU5xM7ZgCafbUK-GODilTm_D-hdEwh1I-nPTUDJ5c0YG4NJ2uqByHswQgJdXR3u4TnfQcS-07VNOl-fcedRBu36TdmF0fTW87LnlcAVX-szL3QA9qmIVZ5RzH4NECEpEFjHJ4ghlxGNJBFLU0ZJkGVU6Sgs8D30ZGQC8RGXBHtSmsynum-ooxpEQTIgg2tZJznwReJnUwVmkfE4b0K64ncoSedwMwJikNgIhLNXySY180lI-DThbUDwXqBt_rN0x7F6sKzndgGYl0LRUy1dNx7QLFGgv6uB3qhNY7Q379-n9zeDuENbMPkVBWRNq-cscj7QHkotje_E-ALNC1Lk
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Mutually+Uncorrelated+Primers+for+DNA-Based+Data+Storage&rft.jtitle=IEEE+transactions+on+information+theory&rft.au=Tabatabaei+Yazdi%2C+S.+M.+H.&rft.au=Kiah%2C+Han+Mao&rft.au=Gabrys%2C+Ryan&rft.au=Milenkovic%2C+Olgica&rft.date=2018-09-01&rft.issn=0018-9448&rft.eissn=1557-9654&rft.volume=64&rft.issue=9&rft.spage=6283&rft.epage=6296&rft_id=info:doi/10.1109%2FTIT.2018.2792488&rft.externalDBID=n%2Fa&rft.externalDocID=10_1109_TIT_2018_2792488
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0018-9448&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0018-9448&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0018-9448&client=summon