Phylogenetic diversity of the enteric pathogen Salmonella enterica subsp. enterica inferred from genome-wide reference-free SNP characters

The enteric pathogen Salmonella enterica is one of the leading causes of foodborne illness in the world. The species is extremely diverse, containing more than 2,500 named serovars that are designated for their unique antigen characters and pathogenicity profiles-some are known to be virulent pathog...

Full description

Saved in:
Bibliographic Details
Published inGenome biology and evolution Vol. 5; no. 11; pp. 2109 - 2123
Main Authors Timme, Ruth E, Pettengill, James B, Allard, Marc W, Strain, Errol, Barrangou, Rodolphe, Wehnes, Chris, Van Kessel, Joann S, Karns, Jeffrey S, Musser, Steven M, Brown, Eric W
Format Journal Article
LanguageEnglish
Published England Oxford University Press 01.01.2013
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The enteric pathogen Salmonella enterica is one of the leading causes of foodborne illness in the world. The species is extremely diverse, containing more than 2,500 named serovars that are designated for their unique antigen characters and pathogenicity profiles-some are known to be virulent pathogens, while others are not. Questions regarding the evolution of pathogenicity, significance of antigen characters, diversity of clustered regularly interspaced short palindromic repeat (CRISPR) loci, among others, will remain elusive until a strong evolutionary framework is established. We present the first large-scale S. enterica subsp. enterica phylogeny inferred from a new reference-free k-mer approach of gathering single nucleotide polymorphisms (SNPs) from whole genomes. The phylogeny of 156 isolates representing 78 serovars (102 were newly sequenced) reveals two major lineages, each with many strongly supported sublineages. One of these lineages is the S. Typhi group; well nested within the phylogeny. Lineage-through-time analyses suggest there have been two instances of accelerated rates of diversification within the subspecies. We also found that antigen characters and CRISPR loci reveal different evolutionary patterns than that of the phylogeny, suggesting that a horizontal gene transfer or possibly a shared environmental acquisition might have influenced the present character distribution. Our study also shows the ability to extract reference-free SNPs from a large set of genomes and then to use these SNPs for phylogenetic reconstruction. This automated, annotation-free approach is an important step forward for bacterial disease tracking and in efficiently elucidating the evolutionary history of highly clonal organisms.
Bibliography:Associate editor: Emmanuelle Lerat
Data Deposition: GenBank accession numbers for the 156 genomes used in this analysis are listed in Supplementary Table S1, Supplementary Material online. The 102 newly sequenced genomes are listed in the Data Deposition section of the Materials and Methods. The SNP matrix and phylogenetic trees are available at TreeBase.org, study number S14912. Individual SNPs were deposited at dbSNP database at NCBI under the accession range ss749616252–ss749736198.
ISSN:1759-6653
1759-6653
DOI:10.1093/gbe/evt159