Machine learning methods for short-term probability of default: A comparison of classification, regression and ranking methods
Probability of default estimation via machine learning on historical data is widely studied in credit risk modeling. In this work, we investigated the use of machine learning for a finer-grained risk estimation task, namely spot factoring. Here, the goal is to estimate the likelihood that an invoice...
Saved in:
Published in | The Journal of the Operational Research Society Vol. 73; no. 1; pp. 191 - 206 |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | English |
Published |
Taylor & Francis
02.01.2022
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Probability of default estimation via machine learning on historical data is widely studied in credit risk modeling. In this work, we investigated the use of machine learning for a finer-grained risk estimation task, namely spot factoring. Here, the goal is to estimate the likelihood that an invoice will be paid in an acceptable timeframe. In this case, risk is more related to the overdueness of an invoice. Based on this observation, we investigate three possible machine learning tasks for estimating this risk: binary classification for a predetermined overdue days cutoff; regression of the overdue days; and learning-to-rank which learns to optimize the risk-related ranking for the full range of instances. We model and evaluate these tasks using real-life spot factoring data. Finally, we perform a profit-driven evaluation that shows that regression models can lead to higher profits and better spread the risk than classification and ranking models for spot factoring. |
---|---|
ISSN: | 0160-5682 1476-9360 |
DOI: | 10.1080/01605682.2020.1865847 |