A TSICN-based Inferential Synthesis Method for Class Imbalance in Credit Scoring

lass imbalance and data obsolescence are two major issues in the field of credit scoring, leading to excessive bias and inaccuracies in the classification process of credit scoring models. In order to augment minority class samples and balance time dependencies, this paper proposes a credit scoring...

Full description

Saved in:
Bibliographic Details
Published in2023 IEEE International Conference on Data Mining (ICDM) pp. 992 - 997
Main Authors Fanab, Dongxu, Feng, Xuanzhi, Jiang, Jinghe, Jiang, Yuming, Zhang, Le, Hu, Dasha
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.12.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:lass imbalance and data obsolescence are two major issues in the field of credit scoring, leading to excessive bias and inaccuracies in the classification process of credit scoring models. In order to augment minority class samples and balance time dependencies, this paper proposes a credit scoring model based on Temporal Sample Interaction Convolutional Network (TSICN) to facilitate better credit risk assessment for financial institutions. It relies on the intrinsic features of the minority class samples to synthesize new data, thereby increasing the quantity of the minority class samples. Through inference and synthesis, It greatly mitigates the loss of information from minority class samples. The synthesized minority class samples, blended with the original data, are inputted into the causal convolutional layers and dilated convolutional layers. The information flow and memory updates are regulated through reset gate and update gate, The reset gate determines how to combine past information with the current input, while the update gate determines how to combine the previous hidden state with the current candidate hidden state. Compared to common methods, TSICN can integrate information features from minority class samples into the synthesis data, focusing more on short-term dependencies while reducing the capture of long-term dependencies. Experimental results show that TSICN achieves excellent credit scoring classification performance on real-world datasets. This enables more accurate prediction of applicant credit risk, thus reducing the risk of loan default.
ISSN:2374-8486
DOI:10.1109/ICDM58522.2023.00113