Skip to main navigation Skip to search Skip to main content

Advancing credit risk assessment in the retail banking industry: A hybrid approach using time series and supervised learning models

  • Sebastian H. Goldmann
  • , Marcos R. Machado*
  • , Joerg R. Osterrieder
  • *Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

24 Downloads (Pure)

Abstract

Credit risk assessment remains a central challenge in retail banking, with conventional models often falling short in predictive accuracy and adaptability to granular customer behavior. This study explores the potential of Time Series Classification (TSC) algorithms to enhance credit risk modeling by analyzing customers’ historical end-of-day balance data. We compare traditional Machine Learning (ML) models – including Logistic Regression and XGBoost – with advanced TSC methods such as Shapelets, Long Short-Term Memory (LSTM) networks, and Canonical Interval Forests (CIF). Our results show that TSC algorithms, particularly CIF and Shapelet-based methods, significantly outperform traditional approaches. When using CIF-derived Probability of Default (PD) estimates as additional features in an XGBoost model, predictive performance improved notably: the combined model achieved an Area under the Curve (AUC) of 0.81, compared to 0.79 for CIF alone and 0.77 for XGBoost without the CIF input. These findings underscore the value of integrating temporal features into credit risk assessment frameworks. Moreover, the complementary strengths of the TSC and XGBoost models across different Receiver Operating Characteristic (ROC) curve regions demonstrate the practical benefits of model stacking. However, performance dropped when using aggregated monthly data, highlighting the importance of preserving high-frequency behavioral signals. This research contributes to more accurate, interpretable, and robust credit risk models and offers a pathway for banks to leverage time series data for improved risk forecasting.

Original languageEnglish
Article number102490
JournalData and Knowledge Engineering
Volume160
DOIs
Publication statusPublished - Nov 2025

Keywords

  • UT-Hybrid-D
  • Hybrid Machine Learning
  • Probability of default
  • Retail banking
  • Time series classification
  • Consumers’ credit risk assessment

Fingerprint

Dive into the research topics of 'Advancing credit risk assessment in the retail banking industry: A hybrid approach using time series and supervised learning models'. Together they form a unique fingerprint.

Cite this