A comparison of data mining techniques in evaluating retail credit scoring using R programming

Retail credit scoring has become more efficient in recent years because of the use of data mining techniques that allow marketing officers and top managers to better estimate their customers credibility. In recent years, many complicated models have been developed; however there are few of them whic...

Full description

Saved in:

Bibliographic Details
Published in	2015 Twelve International Conference on Electronics Computer and Computation (ICECCO) pp. 1 - 4
Main Authors	Zakirov, Dilmurat, Bondarev, Aleksey, Momtselidze, Nodar
Format	Conference Proceeding
Language	English
Published	IEEE 01.09.2015
Subjects	Boosting Computational modeling credit scoring Data mining kNN Predictive models R programming random forest Regression tree analysis Support vector machines Vegetation
Online Access	Get full text
DOI	10.1109/ICECCO.2015.7416867

Cover

More Information
Summary:	Retail credit scoring has become more efficient in recent years because of the use of data mining techniques that allow marketing officers and top managers to better estimate their customers credibility. In recent years, many complicated models have been developed; however there are few of them which continues to be used because of its efficiency and simplicity. This study investigates k-Nearest Neighbourhood (kNN), support vector machines (SVMs), gradient boosted model (GBM), Naive Bayes classification, Classification and Regression Tree (CART) and Random Forest (RF) as analytical methods for customer credit scoring estimation and evaluation, using real dataset. At the end of the study it is found that Random Forest model with down-sampling (RF_US) has better accuracy rate when compared to other models.
DOI:	10.1109/ICECCO.2015.7416867