A Grammatical Swarm for protein classification

We present a grammatical swarm (GS) for the optimization of an aggregation operator. This combines the results of several classifiers into a unique score, producing an optimal ranking of the individuals. We apply our method to the identification of new members of a protein family. Support vector mac...

Full description

Saved in:
Bibliographic Details
Published in2008 IEEE Congress on Evolutionary Computation (IEEE World Congress on Computational Intelligence) pp. 2561 - 2568
Main Authors Ramstein, G., Beaume, N., Jacques, Y.
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.06.2008
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:We present a grammatical swarm (GS) for the optimization of an aggregation operator. This combines the results of several classifiers into a unique score, producing an optimal ranking of the individuals. We apply our method to the identification of new members of a protein family. Support vector machine and naive Bayes classifiers exploit complementary features to compute probability estimates. A great advantage of the GS is that it produces an understandable algorithm revealing the interest of the classifiers. Due to the large volume of candidate sequences, ranking quality is of crucial importance. Consequently, our fitness criterion is based on the area under the ROC curve rather than on classification error rate. We discuss the performances obtained for a particular family, the cytokines and show that this technique is an efficient means of ranking the protein sequences.
ISBN:1424418224
9781424418220
ISSN:1089-778X
1941-0026
DOI:10.1109/CEC.2008.4631142