Sparse Group Penalties for bi‐level variable selection

Many data sets exhibit a natural group structure due to contextual similarities or high correlations of variables, such as lipid markers that are interrelated based on biochemical principles. Knowledge of such groupings can be used through bi‐level selection methods to identify relevant feature grou...

Full description

Saved in:
Bibliographic Details
Published inBiometrical journal Vol. 66; no. 4; pp. e2200334 - n/a
Main Authors Buch, Gregor, Schulz, Andreas, Schmidtmann, Irene, Strauch, Konstantin, Wild, Philipp S.
Format Journal Article
LanguageEnglish
Published Germany Wiley - VCH Verlag GmbH & Co. KGaA 01.06.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Many data sets exhibit a natural group structure due to contextual similarities or high correlations of variables, such as lipid markers that are interrelated based on biochemical principles. Knowledge of such groupings can be used through bi‐level selection methods to identify relevant feature groups and highlight their predictive members. One of the best known approaches of this kind combines the classical Least Absolute Shrinkage and Selection Operator (LASSO) with the Group LASSO, resulting in the Sparse Group LASSO. We propose the Sparse Group Penalty (SGP) framework, which allows for a flexible combination of different SGL‐style shrinkage conditions. Analogous to SGL, we investigated the combination of the Smoothly Clipped Absolute Deviation (SCAD), the Minimax Concave Penalty (MCP) and the Exponential Penalty (EP) with their group versions, resulting in the Sparse Group SCAD, the Sparse Group MCP, and the novel Sparse Group EP (SGE). Those shrinkage operators provide refined control of the effect of group formation on the selection process through a tuning parameter. In simulation studies, SGPs were compared with other bi‐level selection methods (Group Bridge, composite MCP, and Group Exponential LASSO) for variable and group selection evaluated with the Matthews correlation coefficient. We demonstrated the advantages of the new SGE in identifying parsimonious models, but also identified scenarios that highlight the limitations of the approach. The performance of the techniques was further investigated in a real‐world use case for the selection of regulated lipids in a randomized clinical trial.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0323-3847
1521-4036
1521-4036
DOI:10.1002/bimj.202200334