Machine learning methods for short-term probability of default: A comparison of classification, regression and ranking methods

Probability of default estimation via machine learning on historical data is widely studied in credit risk modeling. In this work, we investigated the use of machine learning for a finer-grained risk estimation task, namely spot factoring. Here, the goal is to estimate the likelihood that an invoice...

Full description

Saved in:
Bibliographic Details
Published inThe Journal of the Operational Research Society Vol. 73; no. 1; pp. 191 - 206
Main Authors Coenen, Lize, Verbeke, Wouter, Guns, Tias
Format Journal Article
LanguageEnglish
Published Taylor & Francis 02.01.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Probability of default estimation via machine learning on historical data is widely studied in credit risk modeling. In this work, we investigated the use of machine learning for a finer-grained risk estimation task, namely spot factoring. Here, the goal is to estimate the likelihood that an invoice will be paid in an acceptable timeframe. In this case, risk is more related to the overdueness of an invoice. Based on this observation, we investigate three possible machine learning tasks for estimating this risk: binary classification for a predetermined overdue days cutoff; regression of the overdue days; and learning-to-rank which learns to optimize the risk-related ranking for the full range of instances. We model and evaluate these tasks using real-life spot factoring data. Finally, we perform a profit-driven evaluation that shows that regression models can lead to higher profits and better spread the risk than classification and ranking models for spot factoring.
ISSN:0160-5682
1476-9360
DOI:10.1080/01605682.2020.1865847