Representing bacteria with unique genomic signatures

Classifying or identifying bacteria in metagenomic samples is an important problem in the analysis of metagenomic data. This task can be computationally expensive since microbial communities usually consist of hundreds to thousands of environmental microbial species. We proposed a new method for rep...

Full description

Saved in:
Bibliographic Details
Published inFrontiers in big data Vol. 5; p. 1018356
Main Authors Pham, Diem-Trang, Phan, Vinhthuy
Format Journal Article
LanguageEnglish
Published Frontiers Media S.A 16.11.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Classifying or identifying bacteria in metagenomic samples is an important problem in the analysis of metagenomic data. This task can be computationally expensive since microbial communities usually consist of hundreds to thousands of environmental microbial species. We proposed a new method for representing bacteria in a microbial community using genomic signatures of those bacteria. With respect to the microbial community, the genomic signatures of each bacterium are unique to that bacterium; they do not exist in other bacteria in the community. Further, since the genomic signatures of a bacterium are much smaller than its genome size, the approach allows for a compressed representation of the microbial community. This approach uses a modified Bloom filter to store short k-mers with hash values that are unique to each bacterium. We show that most bacteria in many microbiomes can be represented uniquely using the proposed genomic signatures. This approach paves the way toward new methods for classifying bacteria in metagenomic samples.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
This article was submitted to Medicine and Public Health, a section of the journal Frontiers in Big Data
Reviewed by: Mai Oudah, New York University, United States; C. Titus Brown, University of California, Davis, United States
Edited by: Prashanti Manda, University of North Carolina at Greensboro, United States
ISSN:2624-909X
2624-909X
DOI:10.3389/fdata.2022.1018356