A bioinformatic pipeline for identifying informative SNP panels for parentage assignment from RAD seq data

Abstract The development of high‐throughput sequencing technologies is dramatically increasing the use of single nucleotide polymorphisms ( SNP s) across the field of genetics, but most parentage studies of wild populations still rely on microsatellites. We developed a bioinformatic pipeline for ide...

Full description

Saved in:
Bibliographic Details
Published inMolecular ecology resources Vol. 18; no. 6; pp. 1263 - 1281
Main Authors Andrews, Kimberly R., Adams, Jennifer R., Cassirer, E. Frances, Plowright, Raina K., Gardner, Colby, Dwire, Maggie, Hohenlohe, Paul A., Waits, Lisette P.
Format Journal Article
LanguageEnglish
Published 01.11.2018
Online AccessGet full text

Cover

Loading…
More Information
Summary:Abstract The development of high‐throughput sequencing technologies is dramatically increasing the use of single nucleotide polymorphisms ( SNP s) across the field of genetics, but most parentage studies of wild populations still rely on microsatellites. We developed a bioinformatic pipeline for identifying SNP panels that are informative for parentage analysis from restriction site‐associated DNA sequencing ( RAD seq) data. This pipeline includes options for analysis with or without a reference genome, and provides methods to maximize genotyping accuracy and select sets of unlinked loci that have high statistical power. We test this pipeline on small populations of Mexican gray wolf and bighorn sheep, for which parentage analyses are expected to be challenging due to low genetic diversity and the presence of many closely related individuals. We compare the results of parentage analysis across SNP panels generated with or without the use of a reference genome, and between SNP s and microsatellites. For Mexican gray wolf, we conducted parentage analyses for 30 pups from a single cohort where samples were available from 64% of possible mothers and 53% of possible fathers, and the accuracy of parentage assignments could be estimated because true identities of parents were known a priori based on field data. For bighorn sheep, we conducted maternity analyses for 39 lambs from five cohorts where 77% of possible mothers were sampled, but true identities of parents were unknown. Analyses with and without a reference genome produced SNP panels with ≥95% parentage assignment accuracy for Mexican gray wolf, outperforming microsatellites at 78% accuracy. Maternity assignments were completely consistent across all SNP panels for the bighorn sheep, and were 74.4% consistent with assignments from microsatellites. Accuracy and consistency of parentage analysis were not reduced when using as few as 284 SNP s for Mexican gray wolf and 142 SNP s for bighorn sheep, indicating our pipeline can be used to develop SNP genotyping assays for parentage analysis with relatively small numbers of loci.
ISSN:1755-098X
1755-0998
DOI:10.1111/1755-0998.12910