Wrapper- and Ensemble-Based Feature Subset Selection Methods for Biomarker Discovery in Targeted Metabolomics

The discovery of markers allowing for accurate classification of metabolically very similar proband groups constitutes a challenging problem. We apply several search heuristics combined with different classifier types to targeted metabolomics data to identify compound subsets that classify plasma sa...

Full description

Saved in:

Bibliographic Details
Published in	Pattern Recognition in Bioinformatics pp. 121 - 132
Main Authors	Franken, Holger, Lehmann, Rainer, Häring, Hans-Ulrich, Fritsche, Andreas, Stefan, Norbert, Zell, Andreas
Format	Book Chapter
Language	English
Published	Berlin, Heidelberg Springer Berlin Heidelberg 2011
Series	Lecture Notes in Computer Science
Subjects	Average AUCs Feature Subset Feature Subset Selection Hill Climbing Multi Objective Genetic Algorithm
Online Access	Get full text
ISBN	3642248543 9783642248542
ISSN	0302-9743 1611-3349
DOI	10.1007/978-3-642-24855-9_11

Cover

More Information
Summary:	The discovery of markers allowing for accurate classification of metabolically very similar proband groups constitutes a challenging problem. We apply several search heuristics combined with different classifier types to targeted metabolomics data to identify compound subsets that classify plasma samples of insulin sensitive and -resistant subjects, both suffering from non-alcoholic fatty liver disease. Additionally, we integrate these methods into an ensemble and screen selected subsets for common features. We investigate, which methods appear the most suitable for the task, and test feature subsets for robustness and reproducibility. Furthermore, we consider the predictive potential of different compound classes. We find that classifiers fail in discriminating the non-selected data accurately, but benefit considerably from feature subset selection. Especially, a Pareto-based multi-objective genetic algorithm detects highly discriminative subsets and outperforms widely used heuristics. When transferred to new data, feature sets assembled by the ensemble approach show greater robustness than those selected by single methods.
ISBN:	3642248543 9783642248542
ISSN:	0302-9743 1611-3349
DOI:	10.1007/978-3-642-24855-9_11