TY - JOUR
T1 - Green AI in the Finance Industry
T2 - Exploring the Impact of Feature Engineering on the Accuracy and Computational Time of Machine Learning Models
AU - Machado, Marcos
AU - Asadi, Amin
AU - William R. de Souza, Renato
AU - Ugulino, Wallace
N1 - Publisher Copyright:
© 2024 The Authors
PY - 2024/12
Y1 - 2024/12
N2 - As research and practice on applications of Artificial Intelligence (AI) exponentially increase, the support for deployment grows at the same rate. While a large amount of data available enables sophisticated methods to perform feature engineering, reaching higher accuracy, it is imperative to emphasize the computational costs and the efficiency level in which these models operate. This paper contrasts the processing time and accuracy of individual and hybrid Machine Learning (ML) models obtained when predicting customer loyalty in financial settings. We use frameworks that account for feature engineering and green AI philosophy aspects separately within the individual and hybrid proposed approaches. The individual models refer to commonly used regressor-based algorithms (e.g., decision trees, gradient boosting, and LightGBM) widely applied in business problems. The hybrid models use k-Means to cluster customers before implementing the individual regressor-based models. Our findings indicate that using a lower number of features results in a slightly smaller accuracy than models incorporating features. Besides, we explicitly illustrate the trade-off between the higher accuracy and computational time of the hybrid ML models against the lower accuracy and computational time of the individual models when assessing customers' loyalty levels. Thus, our results provide managers with information regarding the model to be deployed based on their firms' specifications.
AB - As research and practice on applications of Artificial Intelligence (AI) exponentially increase, the support for deployment grows at the same rate. While a large amount of data available enables sophisticated methods to perform feature engineering, reaching higher accuracy, it is imperative to emphasize the computational costs and the efficiency level in which these models operate. This paper contrasts the processing time and accuracy of individual and hybrid Machine Learning (ML) models obtained when predicting customer loyalty in financial settings. We use frameworks that account for feature engineering and green AI philosophy aspects separately within the individual and hybrid proposed approaches. The individual models refer to commonly used regressor-based algorithms (e.g., decision trees, gradient boosting, and LightGBM) widely applied in business problems. The hybrid models use k-Means to cluster customers before implementing the individual regressor-based models. Our findings indicate that using a lower number of features results in a slightly smaller accuracy than models incorporating features. Besides, we explicitly illustrate the trade-off between the higher accuracy and computational time of the hybrid ML models against the lower accuracy and computational time of the individual models when assessing customers' loyalty levels. Thus, our results provide managers with information regarding the model to be deployed based on their firms' specifications.
KW - UT-Hybrid-D
KW - Green AI
KW - Machine Learning
KW - Hybrid Machine Learning
KW - Customer Loyalty
KW - Finance Industry
KW - Feature Engineering
UR - http://www.scopus.com/inward/record.url?scp=85206976545&partnerID=8YFLogxK
U2 - 10.1016/j.asoc.2024.112343
DO - 10.1016/j.asoc.2024.112343
M3 - Article
SN - 1568-4946
VL - 167
JO - Applied Soft Computing Journal
JF - Applied Soft Computing Journal
M1 - 112343
ER -