A Comparative Analysis of the Lyve-SET Phylogenomics Pipeline for Genomic Epidemiology of Foodborne Pathogens

Modern epidemiology of foodborne bacterial pathogens in industrialized countries relies increasingly on whole genome sequencing (WGS) techniques. As opposed to profiling techniques such as pulsed-field gel electrophoresis, WGS requires a variety of computational methods. Since 2013, United States ag...

Full description

Saved in:
Bibliographic Details
Published inFrontiers in Microbiology Vol. 8; p. 375
Main Authors Katz, Lee S., Griswold, Taylor, Williams-Newkirk, Amanda J., Wagner, Darlene, Petkau, Aaron, Sieffert, Cameron, Van Domselaar, Gary, Deng, Xiangyu, Carleton, Heather A.
Format Journal Article
LanguageEnglish
Published Switzerland Frontiers Media SA 13.03.2017
Frontiers Media S.A
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Modern epidemiology of foodborne bacterial pathogens in industrialized countries relies increasingly on whole genome sequencing (WGS) techniques. As opposed to profiling techniques such as pulsed-field gel electrophoresis, WGS requires a variety of computational methods. Since 2013, United States agencies responsible for food safety including the CDC, FDA, and USDA, have been performing whole-genome sequencing (WGS) on all found in clinical, food, and environmental samples. Each year, more genomes of other foodborne pathogens such as , and are being sequenced. Comparing thousands of genomes across an entire species requires a fast method with coarse resolution; however, capturing the fine details of highly related isolates requires a computationally heavy and sophisticated algorithm. Most investigations employing WGS depend on being able to identify an outbreak clade whose inter-genomic distances are less than an empirically determined threshold. When the difference between a few single nucleotide polymorphisms (SNPs) can help distinguish between genomes that are likely outbreak-associated and those that are less likely to be associated, we require a fine-resolution method. To achieve this level of resolution, we have developed Lyve-SET, a high-quality SNP pipeline. We evaluated Lyve-SET by retrospectively investigating 12 outbreak data sets along with four other SNP pipelines that have been used in outbreak investigation or similar scenarios. To compare these pipelines, several distance and phylogeny-based comparison methods were applied, which collectively showed that multiple pipelines were able to identify most outbreak clusters and strains. Currently in the US PulseNet system, whole genome multi-locus sequence typing (wgMLST) is the preferred primary method for foodborne WGS cluster detection and outbreak investigation due to its ability to name standardized genomic profiles, its central database, and its ability to be run in a graphical user interface. However, creating a functional wgMLST scheme requires extended up-front development and subject-matter expertise. When a scheme does not exist or when the highest resolution is needed, SNP analysis is used. Using three outbreak data sets, we demonstrated the concordance between Lyve-SET SNP typing and wgMLST. : Lyve-SET can be found at https://github.com/lskatz/Lyve-SET.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Edited by: Sandra Torriani, University of Verona, Italy
Reviewed by: Jason Sahl, Northern Arizona University, USA; Young Min Kwon, University of Arkansas, USA
This article was submitted to Food Microbiology, a section of the journal Frontiers in Microbiology
ISSN:1664-302X
1664-302X
DOI:10.3389/fmicb.2017.00375