Machine learning methods for short-term probability of default: A comparison of classification, regression and ranking methods

Probability of default estimation via machine learning on historical data is widely studied in credit risk modeling. In this work, we investigated the use of machine learning for a finer-grained risk estimation task, namely spot factoring. Here, the goal is to estimate the likelihood that an invoice...

Full description

Saved in:

Bibliographic Details
Published in	The Journal of the Operational Research Society Vol. 73; no. 1; pp. 191 - 206
Main Authors	Coenen, Lize, Verbeke, Wouter, Guns, Tias
Format	Journal Article
Language	English
Published	Taylor & Francis 02.01.2022
Subjects	credit risk learning-to-rank Machine learning probability of default spot factoring
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Probability of default estimation via machine learning on historical data is widely studied in credit risk modeling. In this work, we investigated the use of machine learning for a finer-grained risk estimation task, namely spot factoring. Here, the goal is to estimate the likelihood that an invoice will be paid in an acceptable timeframe. In this case, risk is more related to the overdueness of an invoice. Based on this observation, we investigate three possible machine learning tasks for estimating this risk: binary classification for a predetermined overdue days cutoff; regression of the overdue days; and learning-to-rank which learns to optimize the risk-related ranking for the full range of instances. We model and evaluate these tasks using real-life spot factoring data. Finally, we perform a profit-driven evaluation that shows that regression models can lead to higher profits and better spread the risk than classification and ranking models for spot factoring.
ISSN:	0160-5682 1476-9360
DOI:	10.1080/01605682.2020.1865847