The Adaptive $\tau$-Lasso: Robustness and Oracle Properties
This paper introduces a new regularized version of the robust $\tau$-regression estimator for analyzing high-dimensional datasets subject to gross contamination in the response variables and covariates. The resulting estimator, termed adaptive $\tau$-Lasso, is robust to outliers and high-leverage po...
Saved in:
Main Authors | , |
---|---|
Format | Journal Article |
Language | English |
Published |
18.04.2023
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | This paper introduces a new regularized version of the robust
$\tau$-regression estimator for analyzing high-dimensional datasets subject to
gross contamination in the response variables and covariates. The resulting
estimator, termed adaptive $\tau$-Lasso, is robust to outliers and
high-leverage points. It also incorporates an adaptive $\ell_1$-norm penalty
term, which enables the selection of relevant variables and reduces the bias
associated with large true regression coefficients. More specifically, this
adaptive $\ell_1$-norm penalty term assigns a weight to each regression
coefficient. For a fixed number of predictors $p$, we show that the adaptive
$\tau$-Lasso has the oracle property, ensuring both variable-selection
consistency and asymptotic normality. Asymptotic normality applies only to the
entries of the regression vector corresponding to the true support, assuming
knowledge of the true regression vector support. We characterize its robustness
by establishing the finite-sample breakdown point and the influence function.
We carry out extensive simulations and observe that the class of $\tau$-Lasso
estimators exhibits robustness and reliable performance in both contaminated
and uncontaminated data settings. We also validate our theoretical findings on
robustness properties through simulations. In the face of outliers and
high-leverage points, the adaptive $\tau$-Lasso and $\tau$-Lasso estimators
achieve the best performance or match the best performances of competing
regularized estimators, with minimal or no loss in terms of prediction and
variable selection accuracy for almost all scenarios considered in this study.
Therefore, the adaptive $\tau$-Lasso and $\tau$-Lasso estimators provide
attractive tools for a variety of sparse linear regression problems,
particularly in high-dimensional settings and when the data is contaminated by
outliers and high-leverage points. |
---|---|
DOI: | 10.48550/arxiv.2304.09310 |