Machine Learning model based on the Light  Gradient Boosting Machine to predict the probability of default in customers of the Credit Card portfolio

Authors

  • Eduardo Rafael Jáuregui Romero Universidad Nacional Mayor de San Marcos, Lima, Peru

DOI:

https://doi.org/10.15381/risi.v16i2.27140

Keywords:

Credit risk, machine learning, data analysis, probability of default, credit card, proof of payment

Abstract

Bank loans are a widely used means of payment in recent times, more and more people are accessing products such as credit cards, loans, etc. Banks have implemented classic prediction models, the vast majority based on logistic regression since it allows great interpretability for the business and the effect of the model variables. The purpose of this research is to perform a predictive analysis on the probability of customer default in the credit card portfolio using a risk score. The dataset used is the so-called default of credit card clients Data Set from the UCI Machine Learning DB, the approach is quantitative and the methodology is descriptive analytics, techniques based on gradient boosters will be used to make the prediction, among the trained algorithms We have Logistic Regression with WOE, CatBoost, As a result, the light gradient enhancement machine (LightGBM) tuned with a Bayesian search was obtained, obtaining a GINI of 57.4, which improves by +6 points to the Logistic Regression with Woe and by +3p to XgBoost and CatBoost. Finally, obtaining the Gain and Shapley values made up for the lack of interpretability of the variables, allowing better decision making when evaluating clients. Likewise, as future work, it is intended to add unstructured variables that allow the Model's indicators to be improved.

Downloads

Download data is not yet available.

Downloads

Published

2023-12-30

Issue

Section

Original Research Articles

How to Cite

[1]
“Machine Learning model based on the Light  Gradient Boosting Machine to predict the probability of default in customers of the Credit Card portfolio”, Rev.Investig.sist.inform., vol. 16, no. 2, pp. 155–168, Dec. 2023, doi: 10.15381/risi.v16i2.27140.