Pulling out the 1%: Whole-Genome Capture for the Targeted Enrichment of Ancient DNA Sequencing Libraries

Most ancient specimens contain very low levels of endogenous DNA, precluding the shotgun sequencing of many interesting samples because of cost. Ancient DNA (aDNA) libraries often contain <1% endogenous DNA, with the majority of sequencing capacity taken up by environmental DNA. Here we present a...

Full description

Saved in:
Bibliographic Details
Published inAmerican journal of human genetics Vol. 93; no. 5; pp. 852 - 864
Main Authors Carpenter, Meredith L., Buenrostro, Jason D., Valdiosera, Cristina, Schroeder, Hannes, Allentoft, Morten E., Sikora, Martin, Rasmussen, Morten, Gravel, Simon, Guillén, Sonia, Nekhrizov, Georgi, Leshtakov, Krasimir, Dimitrova, Diana, Theodossiev, Nikola, Pettener, Davide, Luiselli, Donata, Sandoval, Karla, Moreno-Estrada, Andrés, Li, Yingrui, Wang, Jun, Gilbert, M. Thomas P., Willerslev, Eske, Greenleaf, William J., Bustamante, Carlos D.
Format Journal Article
LanguageEnglish
Published United States Elsevier Inc 07.11.2013
Cell Press
Elsevier
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Most ancient specimens contain very low levels of endogenous DNA, precluding the shotgun sequencing of many interesting samples because of cost. Ancient DNA (aDNA) libraries often contain <1% endogenous DNA, with the majority of sequencing capacity taken up by environmental DNA. Here we present a capture-based method for enriching the endogenous component of aDNA sequencing libraries. By using biotinylated RNA baits transcribed from genomic DNA libraries, we are able to capture DNA fragments from across the human genome. We demonstrate this method on libraries created from four Iron Age and Bronze Age human teeth from Bulgaria, as well as bone samples from seven Peruvian mummies and a Bronze Age hair sample from Denmark. Prior to capture, shotgun sequencing of these libraries yielded an average of 1.2% of reads mapping to the human genome (including duplicates). After capture, this fraction increased substantially, with up to 59% of reads mapped to human and enrichment ranging from 6- to 159-fold. Furthermore, we maintained coverage of the majority of regions sequenced in the precapture library. Intersection with the 1000 Genomes Project reference panel yielded an average of 50,723 SNPs (range 3,062–147,243) for the postcapture libraries sequenced with 1 million reads, compared with 13,280 SNPs (range 217–73,266) for the precapture libraries, increasing resolution in population genetic analyses. Our whole-genome capture approach makes it less costly to sequence aDNA from specimens containing very low levels of endogenous DNA, enabling the analysis of larger numbers of samples.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ObjectType-Article-2
ObjectType-Feature-1
These authors contributed equally to this work
These authors contributed equally to this work and are co-senior authors
ISSN:0002-9297
1537-6605
DOI:10.1016/j.ajhg.2013.10.002