Fairness in credit scoring: Assessment, implementation and profit implications

•A systematic overview of fairness implementation techniques is provided.•The adequacy of established fairness criteria for credit scoring is examined.•Empirical experiments on seven data sets analyze agreement of fairness criteria.•Promising fairness processors are identified based on the empirical...

Full description

Saved in:
Bibliographic Details
Published inEuropean journal of operational research Vol. 297; no. 3; pp. 1083 - 1094
Main Authors Kozodoi, Nikita, Jacob, Johannes, Lessmann, Stefan
Format Journal Article
LanguageEnglish
Published Elsevier B.V 16.03.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:•A systematic overview of fairness implementation techniques is provided.•The adequacy of established fairness criteria for credit scoring is examined.•Empirical experiments on seven data sets analyze agreement of fairness criteria.•Promising fairness processors are identified based on the empirical results.•Analysis of the Pareto frontiers examines the profit-fairness trade-off. [Display omitted] The rise of algorithmic decision-making has spawned much research on fair machine learning (ML). Financial institutions use ML for building risk scorecards that support a range of credit-related decisions. Yet, the literature on fair ML in credit scoring is scarce. The paper makes three contributions. First, we revisit statistical fairness criteria and examine their adequacy for credit scoring. Second, we catalog algorithmic options for incorporating fairness goals in the ML model development pipeline. Last, we empirically compare different fairness processors in a profit-oriented credit scoring context using real-world data. The empirical results substantiate the evaluation of fairness measures, identify suitable options to implement fair credit scoring, and clarify the profit-fairness trade-off in lending decisions. We find that multiple fairness criteria can be approximately satisfied at once and recommend separation as a proper criterion for measuring the fairness of a scorecard. We also find fair in-processors to deliver a good balance between profit and fairness and show that algorithmic discrimination can be reduced to a reasonable level at a relatively low cost. The codes corresponding to the paper are available on GitHub.
ISSN:0377-2217
1872-6860
DOI:10.1016/j.ejor.2021.06.023