Development of a Dual-Index Sequencing Strategy and Curation Pipeline for Analyzing Amplicon Sequence Data on the MiSeq Illumina Sequencing Platform

Classifications Services AEM Citing Articles Google Scholar PubMed Related Content Social Bookmarking CiteULike Delicious Digg Facebook Google+ Mendeley Reddit StumbleUpon Twitter current issue Spotlights in the Current Issue AEM About AEM Subscribers Authors Reviewers Advertisers Inquiries from the...

Full description

Saved in:
Bibliographic Details
Published inApplied and Environmental Microbiology Vol. 79; no. 17; pp. 5112 - 5120
Main Authors Kozich, James J, Westcott, Sarah L, Baxter, Nielson T, Highlander, Sarah K, Schloss, Patrick D
Format Journal Article
LanguageEnglish
Published United States American Society for Microbiology 01.09.2013
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Classifications Services AEM Citing Articles Google Scholar PubMed Related Content Social Bookmarking CiteULike Delicious Digg Facebook Google+ Mendeley Reddit StumbleUpon Twitter current issue Spotlights in the Current Issue AEM About AEM Subscribers Authors Reviewers Advertisers Inquiries from the Press Permissions & Commercial Reprints ASM Journals Public Access Policy AEM RSS Feeds 1752 N Street N.W. • Washington DC 20036 202.737.3600 • 202.942.9355 fax • journals@asmusa.org Print ISSN: 0099-2240 Online ISSN: 1098-5336 Copyright © 2014 by the American Society for Microbiology.   For an alternate route to AEM .asm.org, visit: AEM       
AbstractList Rapid advances in sequencing technology have changed the experimental landscape of microbial ecology. In the last 10 years, the field has moved from sequencing hundreds of 16S rRNA gene fragments per study using clone libraries to the sequencing of millions of fragments per study using next-generation sequencing technologies from 454 and Illumina. As these technologies advance, it is critical to assess the strengths, weaknesses, and overall suitability of these platforms for the interrogation of microbial communities. Here, we present an improved method for sequencing variable regions within the 16S rRNA gene using Illumina's MiSeq platform, which is currently capable of producing paired 250-nucleotide reads. We evaluated three overlapping regions of the 16S rRNA gene that vary in length (i.e., V34, V4, and V45) by resequencing a mock community and natural samples from human feces, mouse feces, and soil. By titrating the concentration of 16S rRNA gene amplicons applied to the flow cell and using a quality score-based approach to correct discrepancies between reads used to construct contigs, we were able to reduce error rates by as much as two orders of magnitude. Finally, we reprocessed samples from a previous study to demonstrate that large numbers of samples could be multiplexed and sequenced in parallel with shotgun metagenomes. These analyses demonstrate that our approach can provide data that are at least as good as that generated by the 454 platform while providing considerably higher sequencing coverage for a fraction of the cost.
Rapid advances in sequencing technology have changed the experimental landscape of microbial ecology. In the last 10 years, the field has moved from sequencing hundreds of 16S rRNA gene fragments per study using clone libraries to the sequencing of millions of fragments per study using next-generation sequencing technologies from 454 and Illumina. As these technologies advance, it is critical to assess the strengths, weaknesses, and overall suitability of these platforms for the interrogation of microbial communities. Here, we present an improved method for sequencing variable regions within the 16S rRNA gene using Illumina's MiSeq platform, which is currently capable of producing paired 250-nucleotide reads. We evaluated three overlapping regions of the 16S rRNA gene that vary in length (i.e., V34, V4, and V45) by resequencing a mock community and natural samples from human feces, mouse feces, and soil. By titrating the concentration of 16S rRNA gene amplicons applied to the flow cell and using a quality score-based approach to correct discrepancies between reads used to construct contigs, we were able to reduce error rates by as much as two orders of magnitude. Finally, we reprocessed samples from a previous study to demonstrate that large numbers of samples could be multiplexed and sequenced in parallel with shotgun metagenomes. These analyses demonstrate that our approach can provide data that are at least as good as that generated by the 454 platform while providing considerably higher sequencing coverage for a fraction of the cost.Rapid advances in sequencing technology have changed the experimental landscape of microbial ecology. In the last 10 years, the field has moved from sequencing hundreds of 16S rRNA gene fragments per study using clone libraries to the sequencing of millions of fragments per study using next-generation sequencing technologies from 454 and Illumina. As these technologies advance, it is critical to assess the strengths, weaknesses, and overall suitability of these platforms for the interrogation of microbial communities. Here, we present an improved method for sequencing variable regions within the 16S rRNA gene using Illumina's MiSeq platform, which is currently capable of producing paired 250-nucleotide reads. We evaluated three overlapping regions of the 16S rRNA gene that vary in length (i.e., V34, V4, and V45) by resequencing a mock community and natural samples from human feces, mouse feces, and soil. By titrating the concentration of 16S rRNA gene amplicons applied to the flow cell and using a quality score-based approach to correct discrepancies between reads used to construct contigs, we were able to reduce error rates by as much as two orders of magnitude. Finally, we reprocessed samples from a previous study to demonstrate that large numbers of samples could be multiplexed and sequenced in parallel with shotgun metagenomes. These analyses demonstrate that our approach can provide data that are at least as good as that generated by the 454 platform while providing considerably higher sequencing coverage for a fraction of the cost.
Classifications Services AEM Citing Articles Google Scholar PubMed Related Content Social Bookmarking CiteULike Delicious Digg Facebook Google+ Mendeley Reddit StumbleUpon Twitter current issue Spotlights in the Current Issue AEM About AEM Subscribers Authors Reviewers Advertisers Inquiries from the Press Permissions & Commercial Reprints ASM Journals Public Access Policy AEM RSS Feeds 1752 N Street N.W. • Washington DC 20036 202.737.3600 • 202.942.9355 fax • journals@asmusa.org Print ISSN: 0099-2240 Online ISSN: 1098-5336 Copyright © 2014 by the American Society for Microbiology.   For an alternate route to AEM .asm.org, visit: AEM       
Author Nielson T. Baxter
Sarah L. Westcott
Patrick D. Schloss
Sarah K. Highlander
James J. Kozich
Author_xml – sequence: 1
  givenname: James J
  surname: Kozich
  fullname: Kozich, James J
  organization: Department of Microbiology and Immunology, University of Michigan, Ann Arbor, Michigan, USA
– sequence: 2
  givenname: Sarah L
  surname: Westcott
  fullname: Westcott, Sarah L
– sequence: 3
  givenname: Nielson T
  surname: Baxter
  fullname: Baxter, Nielson T
– sequence: 4
  givenname: Sarah K
  surname: Highlander
  fullname: Highlander, Sarah K
– sequence: 5
  givenname: Patrick D
  surname: Schloss
  fullname: Schloss, Patrick D
BackLink https://www.ncbi.nlm.nih.gov/pubmed/23793624$$D View this record in MEDLINE/PubMed
BookMark eNqNkktv1DAQgC1URLeFG2dkceKSYsdxbF-QVruFrtSqlQpnazaZbIwcO-RRWH4HPxhX3aJy68nyzKfP8_AJOQoxICFvOTvjPNcfl-dXZ4yzQmRcvCALzozOpBDlEVkwZkyW5wU7Jifj-J0xVrBSvyLHuVBGlHmxIH_WeIc-9h2GicaGAl3P4LNNqPEXvcUfM4bKhR29nQaYcLenEGq6mtPFxUBvXI_eBaRNHOgygN__voeXXe9dlfIHAdI1TEBTYGqRXrkUphvv584FePrIjYcpmbrX5GUDfsQ3h_OUfPt8_nV1kV1ef9mslpdZWwg2ZbJqsDZayLqqGddoTLFlGoysSl0rYJiXYltVWvOa5wlQhotSIZONUo0sQZySTw_eft52WFdpBgN42w-ug2FvIzj7fya41u7inRVKCqNEEnw4CIaYuhgn27mxQu8hYJxHywtRMC6ZKp6BciOEkeW99d3Tsv7V87i1BLx_AFq3a3-6AS2MnQXsrDKWKyvTxxB_AagSqCs
ContentType Journal Article
Copyright Copyright © 2013, American Society for Microbiology. All Rights Reserved. 2013 American Society for Microbiology
Copyright_xml – notice: Copyright © 2013, American Society for Microbiology. All Rights Reserved. 2013 American Society for Microbiology
DBID CGR
CUY
CVF
ECM
EIF
NPM
7X8
7T7
8FD
C1K
FR3
P64
5PM
DOI 10.1128/AEM.01043-13
DatabaseName Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
MEDLINE - Academic
Industrial and Applied Microbiology Abstracts (Microbiology A)
Technology Research Database
Environmental Sciences and Pollution Management
Engineering Research Database
Biotechnology and BioEngineering Abstracts
PubMed Central (Full Participant titles)
DatabaseTitle MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
MEDLINE - Academic
Engineering Research Database
Technology Research Database
Industrial and Applied Microbiology Abstracts (Microbiology A)
Biotechnology and BioEngineering Abstracts
Environmental Sciences and Pollution Management
DatabaseTitleList MEDLINE

Engineering Research Database
MEDLINE - Academic

Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
DeliveryMethod fulltext_linktorsrc
Discipline Economics
Engineering
Biology
Medicine
Pharmacy, Therapeutics, & Pharmacology
EISSN 1098-5336
1098-6596
EndPage 5120
ExternalDocumentID PMC3753973
23793624
aem_79_17_5112
Genre Journal Article
Research Support, N.I.H., Extramural
GrantInformation_xml – fundername: NIGMS NIH HHS
  grantid: 5R01GM099514
– fundername: NIGMS NIH HHS
  grantid: R01 GM099514
– fundername: NHGRI NIH HHS
  grantid: R01 HG005975
– fundername: NIDDK NIH HHS
  grantid: P30DK034933
– fundername: NHGRI NIH HHS
  grantid: U54 HG004973
– fundername: NHGRI NIH HHS
  grantid: R01HG005975
– fundername: NIDDK NIH HHS
  grantid: P30 DK034933
– fundername: NHGRI NIH HHS
  grantid: U54HG004973
GroupedDBID ---
-~X
.55
.GJ
0R~
23M
2WC
39C
3O-
4.4
53G
5GY
5RE
5VS
6J9
85S
AAZTW
ABOGM
ABPPZ
ABTAH
ACBTR
ACGFO
ACIWK
ACNCT
ACPRK
ADBBV
ADUKH
AENEX
AFFNX
AFRAH
AGCDD
AGVNZ
AI.
ALMA_UNASSIGNED_HOLDINGS
AOIJS
BAWUL
BKOMP
BTFSW
C1A
CGR
CS3
CUY
CVF
D0L
DIK
E.-
E3Z
EBS
ECM
EIF
EJD
F5P
GX1
H13
HYE
HZ~
H~9
K-O
KQ8
L7B
MVM
NEJ
NPM
O9-
OHT
P2P
PQQKQ
RHI
RNS
RPM
RSF
RXW
TAE
TAF
TN5
TR2
TWZ
UHB
VH1
W8F
WH7
WHG
WOQ
X6Y
X7M
XJT
YV5
ZCG
ZGI
ZXP
ZY4
~02
~KM
7X8
AAGFI
7T7
8FD
C1K
FR3
FRP
HH5
LSO
OK1
P64
W2D
5PM
ID FETCH-LOGICAL-h430t-5cfed9835dcd018e994b08a95c68d7a0e263bcc881d12018791367e05f77f56a3
ISSN 0099-2240
1098-5336
IngestDate Thu Aug 21 14:32:04 EDT 2025
Fri Jul 11 02:32:09 EDT 2025
Thu Jul 10 18:04:55 EDT 2025
Thu Apr 03 06:59:48 EDT 2025
Wed May 18 15:27:21 EDT 2016
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 17
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-h430t-5cfed9835dcd018e994b08a95c68d7a0e263bcc881d12018791367e05f77f56a3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ObjectType-Article-2
ObjectType-Feature-1
OpenAccessLink https://aem.asm.org/content/aem/79/17/5112.full.pdf
PMID 23793624
PQID 1419339563
PQPubID 23479
PageCount 9
ParticipantIDs highwire_asm_aem_79_17_5112
pubmedcentral_primary_oai_pubmedcentral_nih_gov_3753973
proquest_miscellaneous_1434015074
proquest_miscellaneous_1419339563
pubmed_primary_23793624
PublicationCentury 2000
PublicationDate 2013-09-01
PublicationDateYYYYMMDD 2013-09-01
PublicationDate_xml – month: 09
  year: 2013
  text: 2013-09-01
  day: 01
PublicationDecade 2010
PublicationPlace United States
PublicationPlace_xml – name: United States
– name: 1752 N St., N.W., Washington, DC
PublicationTitle Applied and Environmental Microbiology
PublicationTitleAlternate Appl Environ Microbiol
PublicationYear 2013
Publisher American Society for Microbiology
Publisher_xml – name: American Society for Microbiology
References 16880384 - Proc Natl Acad Sci U S A. 2006 Aug 8;103(32):12115-20
23202435 - Nat Methods. 2013 Jan;10(1):57-9
22194782 - PLoS One. 2011;6(12):e27310
21276213 - BMC Bioinformatics. 2011;12:38
22870235 - PLoS One. 2012;7(8):e41606
20011594 - PLoS One. 2009;4(12):e8230
21421784 - Appl Environ Microbiol. 2011 May;77(10):3219-26
24450498 - Microbiome. 2013 Mar 04;1(1):10
21700674 - Bioinformatics. 2011 Aug 15;27(16):2194-200
18987734 - Nature. 2008 Nov 6;456(7218):53-9
17947321 - Nucleic Acids Res. 2007;35(21):7188-96
17586664 - Appl Environ Microbiol. 2007 Aug;73(16):5261-7
22688727 - Gut Microbes. 2012 Jul-Aug;3(4):383-93
20534432 - Proc Natl Acad Sci U S A. 2011 Mar 15;108 Suppl 1:4516-22
20628621 - PLoS Comput Biol. 2010;6(7):e1000844
19725865 - Environ Microbiol. 2010 Jan;12(1):118-23
22170427 - ISME J. 2012 Jul;6(7):1273-6
19251739 - Genome Res. 2009 Jun;19(6):1117-23
19801464 - Appl Environ Microbiol. 2009 Dec;75(23):7537-41
9440526 - J Bacteriol. 1998 Jan;180(2):366-76
22699609 - Nature. 2012 Jun 14;486(7402):207-14
21048977 - PLoS One. 2010;5(10):e15406
22402401 - ISME J. 2012 Aug;6(8):1621-4
21212162 - Genome Res. 2011 Mar;21(3):494-504
22333067 - BMC Bioinformatics. 2012;13:31
23018771 - ISME J. 2013 Mar;7(3):457-60
20236171 - Environ Microbiol. 2010 Jul;12(7):1889-98
References_xml – reference: 22402401 - ISME J. 2012 Aug;6(8):1621-4
– reference: 19251739 - Genome Res. 2009 Jun;19(6):1117-23
– reference: 20011594 - PLoS One. 2009;4(12):e8230
– reference: 20534432 - Proc Natl Acad Sci U S A. 2011 Mar 15;108 Suppl 1:4516-22
– reference: 20628621 - PLoS Comput Biol. 2010;6(7):e1000844
– reference: 23018771 - ISME J. 2013 Mar;7(3):457-60
– reference: 22333067 - BMC Bioinformatics. 2012;13:31
– reference: 21421784 - Appl Environ Microbiol. 2011 May;77(10):3219-26
– reference: 18987734 - Nature. 2008 Nov 6;456(7218):53-9
– reference: 20236171 - Environ Microbiol. 2010 Jul;12(7):1889-98
– reference: 21276213 - BMC Bioinformatics. 2011;12:38
– reference: 22870235 - PLoS One. 2012;7(8):e41606
– reference: 22194782 - PLoS One. 2011;6(12):e27310
– reference: 17947321 - Nucleic Acids Res. 2007;35(21):7188-96
– reference: 17586664 - Appl Environ Microbiol. 2007 Aug;73(16):5261-7
– reference: 21048977 - PLoS One. 2010;5(10):e15406
– reference: 22170427 - ISME J. 2012 Jul;6(7):1273-6
– reference: 22688727 - Gut Microbes. 2012 Jul-Aug;3(4):383-93
– reference: 19801464 - Appl Environ Microbiol. 2009 Dec;75(23):7537-41
– reference: 21212162 - Genome Res. 2011 Mar;21(3):494-504
– reference: 22699609 - Nature. 2012 Jun 14;486(7402):207-14
– reference: 23202435 - Nat Methods. 2013 Jan;10(1):57-9
– reference: 21700674 - Bioinformatics. 2011 Aug 15;27(16):2194-200
– reference: 19725865 - Environ Microbiol. 2010 Jan;12(1):118-23
– reference: 16880384 - Proc Natl Acad Sci U S A. 2006 Aug 8;103(32):12115-20
– reference: 24450498 - Microbiome. 2013 Mar 04;1(1):10
– reference: 9440526 - J Bacteriol. 1998 Jan;180(2):366-76
SSID ssj0004068
ssj0006590
Score 2.652144
Snippet Classifications Services AEM Citing Articles Google Scholar PubMed Related Content Social Bookmarking CiteULike Delicious Digg Facebook Google+ Mendeley Reddit...
Rapid advances in sequencing technology have changed the experimental landscape of microbial ecology. In the last 10 years, the field has moved from sequencing...
SourceID pubmedcentral
proquest
pubmed
highwire
SourceType Open Access Repository
Aggregation Database
Index Database
Publisher
StartPage 5112
SubjectTerms Animals
Biota
Computational Biology - methods
Data processing
Feces - microbiology
High-Throughput Nucleotide Sequencing - methods
High-Throughput Nucleotide Sequencing - standards
Humans
Metagenome
Methods
Mice
Soil Microbiology
Title Development of a Dual-Index Sequencing Strategy and Curation Pipeline for Analyzing Amplicon Sequence Data on the MiSeq Illumina Sequencing Platform
URI http://aem.asm.org/content/79/17/5112.abstract
https://www.ncbi.nlm.nih.gov/pubmed/23793624
https://www.proquest.com/docview/1419339563
https://www.proquest.com/docview/1434015074
https://pubmed.ncbi.nlm.nih.gov/PMC3753973
Volume 79
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lj9MwELb2IQQcEBQWygIyErcoIY3z8nFZivZB0R660t4iJ3G1ldq0oqkE_R38YGbsOHGXagVcospJY1fzdTwznvmGkA9gc_MymfhuycIQQzeFm8NO54JlEfhFHiaFYuAbfYvPrsOLm-hmb__Qylpa17lXbHbWlfyPVGEM5IpVsv8g2falMACfQb5wBQnD9a9kbGX86DrHz2sxc8-RABGUgMqR1oTbig9CMy2drhuZX02XUtmYmGiouEk2KkSiUsyVFtFJ1gCMWjRnCs5oCsPOObZHnlbCnuRqJmo0gG1r15i4OO-wq6jDcuBpRwDVJvFgwq5z4TmXi820aMPUKmbtfPUcbIKDTBLdUYpEqkln7DmfxA8rzVh_49JTSSwzVb5jBzew0QTfShQxp1Z2CusfK2xUO-cu2id6Y9PaHMlSAXWxre517xoD68RS3mh77t5VAqyUOBmOPPRemaurZy2ALecKYQEDbRfrmvBtau87W26bCCnkPEt4NkgynHyfHAbg7QQm6GTKe_04NWSq-AtN_UaQfrTXhLzWzQIstutdntPdBGDLoho_JU8aV4ieaFw_I3uy6pEHujnqzx55aGrmVz3y2KLNfE5-WbiniwkVtMM97SBJDe4pQIAa3FODewpSpi3uqcG9eYGkiHsKA4B7qnBPDe7tSQzuX5DrL8Px6ZnbdBdxb0Pm125UTGTJwQEpi9IfpJLzMPdTwaMiTstE-DKIWV4UKTh0gwBbV3JkN5R-NEmSSRQLdkQOqkUlXxHKfZHnAzaRPitD4Qe8lCnGVkSZlKAJ_T45NuLIxGqebUu9T94bCWWg2_HATlRysV6BWw7uFeNRzO57hoUYtUzCPnmppZotNVFNZuDQJ8mWvNsHkFt--041vVUc8yyJwFNhr-9d-TF51P1p35CD-vtavgUbvc7fKQT_BlmC7I0
linkProvider National Library of Medicine
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Development+of+a+Dual-Index+Sequencing+Strategy+and+Curation+Pipeline+for+Analyzing+Amplicon+Sequence+Data+on+the+MiSeq+Illumina+Sequencing+Platform&rft.jtitle=Applied+and+Environmental+Microbiology&rft.au=James+J.+Kozich&rft.au=Sarah+L.+Westcott&rft.au=Nielson+T.+Baxter&rft.au=Sarah+K.+Highlander&rft.date=2013-09-01&rft.pub=American+Society+for+Microbiology&rft.issn=0099-2240&rft.eissn=1098-5336&rft.volume=79&rft.issue=17&rft.spage=5112&rft_id=info:doi/10.1128%2FAEM.01043-13&rft_id=info%3Apmid%2F23793624&rft.externalDBID=n%2Fa&rft.externalDocID=aem_79_17_5112
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0099-2240&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0099-2240&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0099-2240&client=summon