TY - JOUR
T1 - Predicting retail customers' distress in the finance industry
T2 - An early warning system approach
AU - Beltman, Jaap
AU - Machado, Marcos R.
AU - Osterrieder, Joerg R.
N1 - Publisher Copyright:
© 2024 The Author(s)
PY - 2025/1
Y1 - 2025/1
N2 - Predicting credit defaults is crucial for financial institutions to assess risk and make informed lending decisions. One of the most recent strategies banks and financial institutions have been testing to minimize losses that arise from credit default is the deployment of Early Warning Systems (EWS). By nature, this technique was primarily proposed and explored for commercial customers. However, this study proposes a comprehensive data-driven approach to model Early Warning Systems (EWS) for retail customers in the financial industry while using different Machine Learning (ML) models. We use Logistic Regression (LR), Gradient Boosting (GB), and Random Forest (RF) to classify customers' status, indicating the need to include potential default in a “watch list”. Additionally, we implement a fourth model (i.e., meta-model), whose predictions are based on the output of the other algorithms used (LR, GB, RF). Results indicate that the meta-model achieves higher accuracy than GB or any other individual model tested. From the management perspective, the findings indicate that a higher threshold for warning signals results in alerts closer to the overdue date, indicating increased sensitivity to emerging client deterioration. Conversely, lower thresholds focus more on the client's overall status. Furthermore, using the top ten features for training yields satisfactory overall results, but incorporating features beyond the top ten provides valuable supplementary information to be used in the decision-making process.
AB - Predicting credit defaults is crucial for financial institutions to assess risk and make informed lending decisions. One of the most recent strategies banks and financial institutions have been testing to minimize losses that arise from credit default is the deployment of Early Warning Systems (EWS). By nature, this technique was primarily proposed and explored for commercial customers. However, this study proposes a comprehensive data-driven approach to model Early Warning Systems (EWS) for retail customers in the financial industry while using different Machine Learning (ML) models. We use Logistic Regression (LR), Gradient Boosting (GB), and Random Forest (RF) to classify customers' status, indicating the need to include potential default in a “watch list”. Additionally, we implement a fourth model (i.e., meta-model), whose predictions are based on the output of the other algorithms used (LR, GB, RF). Results indicate that the meta-model achieves higher accuracy than GB or any other individual model tested. From the management perspective, the findings indicate that a higher threshold for warning signals results in alerts closer to the overdue date, indicating increased sensitivity to emerging client deterioration. Conversely, lower thresholds focus more on the client's overall status. Furthermore, using the top ten features for training yields satisfactory overall results, but incorporating features beyond the top ten provides valuable supplementary information to be used in the decision-making process.
KW - UT-Hybrid-D
KW - Finance industry
KW - Machine learning
KW - Meta-model
KW - Retail customers
KW - Early warning systems
UR - http://www.scopus.com/inward/record.url?scp=85206329469&partnerID=8YFLogxK
U2 - 10.1016/j.jretconser.2024.104101
DO - 10.1016/j.jretconser.2024.104101
M3 - Article
AN - SCOPUS:85206329469
SN - 0969-6989
VL - 82
JO - Journal of Retailing and Consumer Services
JF - Journal of Retailing and Consumer Services
M1 - 104101
ER -