Ensemble Learning-Based Feature Selection for Phage Protein Prediction

Phage has high specificity for its host recognition. As a natural enemy of bacteria, it has been used to treat super bacteria many times. Identifying phage proteins from the original sequence is very important for understanding the relationship between phage and host bacteria and developing new anti...

Full description

Saved in:
Bibliographic Details
Published inFrontiers in microbiology Vol. 13; p. 932661
Main Authors Liu, Songbo, Cui, Chengmin, Chen, Huipeng, Liu, Tong
Format Journal Article
LanguageEnglish
Published Frontiers Media S.A 15.07.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Phage has high specificity for its host recognition. As a natural enemy of bacteria, it has been used to treat super bacteria many times. Identifying phage proteins from the original sequence is very important for understanding the relationship between phage and host bacteria and developing new antimicrobial agents. However, traditional experimental methods are both expensive and time-consuming. In this study, an ensemble learning-based feature selection method is proposed to find important features for phage protein identification. The method uses four types of protein sequence-derived features, quantifies the importance of each feature by adding perturbations to the features to influence the results, and finally splices the important features among the four types of features. In addition, we analyzed the selected features and their biological significance.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Reviewed by: Zhiwei Ji, Nanjing Agricultural University, China; Yi Xiong, Shanghai Jiao Tong University, China
Edited by: Jian Huang, University of Electronic Science and Technology of China, China
This article was submitted to Phage Biology, a section of the journal Frontiers in Microbiology
ISSN:1664-302X
1664-302X
DOI:10.3389/fmicb.2022.932661