On the Limit Imbalanced Logistic Regression by Binary Predictors
In this work, we introduce a modified (rescaled) likelihood for imbalanced logistic regression. This new approach makes easier the use of exponential priors and the computation of lasso regularization path. Precisely, we study a limiting behavior for which class imbalance is artificially increased b...
Saved in:
Main Author | |
---|---|
Format | Journal Article |
Language | English |
Published |
27.03.2017
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | In this work, we introduce a modified (rescaled) likelihood for imbalanced
logistic regression. This new approach makes easier the use of exponential
priors and the computation of lasso regularization path. Precisely, we study a
limiting behavior for which class imbalance is artificially increased by
replication of the majority class observations. If some strong overlap
conditions are satisfied, the maximum likelihood estimate converges towards a
finite value close to the initial one (intercept excluded) as shown by
simulations with binary predictors. This solution corresponds to the extremum
of a concave function that we refer to as "rescaled" likelihood. In this
context, the use of exponential priors has a clear interpretation as a shift on
the predictor means for the minority class. Thanks to the simple binary
structure, some random designs give analytic path estimators for the lasso
regularization problem. An effective approximate path algorithm by piecewise
logarithmic functions based on matrix inversions is also presented. This work
was motivated by its potential application to spontaneous reports databases in
a pharmacovigilance context. |
---|---|
DOI: | 10.48550/arxiv.1703.08995 |