Ensemble Learning-Based Feature Selection for Phage Protein Prediction
Phage has high specificity for its host recognition. As a natural enemy of bacteria, it has been used to treat super bacteria many times. Identifying phage proteins from the original sequence is very important for understanding the relationship between phage and host bacteria and developing new anti...
Saved in:
Published in | Frontiers in microbiology Vol. 13; p. 932661 |
---|---|
Main Authors | , , , |
Format | Journal Article |
Language | English |
Published |
Frontiers Media S.A
15.07.2022
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Phage has high specificity for its host recognition. As a natural enemy of bacteria, it has been used to treat super bacteria many times. Identifying phage proteins from the original sequence is very important for understanding the relationship between phage and host bacteria and developing new antimicrobial agents. However, traditional experimental methods are both expensive and time-consuming. In this study, an ensemble learning-based feature selection method is proposed to find important features for phage protein identification. The method uses four types of protein sequence-derived features, quantifies the importance of each feature by adding perturbations to the features to influence the results, and finally splices the important features among the four types of features. In addition, we analyzed the selected features and their biological significance. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 Reviewed by: Zhiwei Ji, Nanjing Agricultural University, China; Yi Xiong, Shanghai Jiao Tong University, China Edited by: Jian Huang, University of Electronic Science and Technology of China, China This article was submitted to Phage Biology, a section of the journal Frontiers in Microbiology |
ISSN: | 1664-302X 1664-302X |
DOI: | 10.3389/fmicb.2022.932661 |