Abstract
Small high-dimensional datasets pose challenges for achieving accurate predictive models, due to issues like overfitting and the curse of dimensionality. While complex models, like deep learning models, have been used to address these challenges, they often lack interpretability and transparency. Explainable Artificial Intelligence (XAI) is a popular field that aims to bridge this gap by developing techniques that provide insights into the decision-making process of machine learning models, therefore increasing the explainability and trustworthiness of models. However, when applying feature selection before model training, less complex models can be used, such that interpretability and explainability are preserved, and explainable AI methods are not even needed. However, in this case, we need an explainable feature selection method. This research presents an iterative feature selection method named IterSHAP, which utilizes a popular XAI technique named SHAP, to increase model performance on small high-dimensional datasets. The performance of IterSHAP was evaluated via both a simulation-based approach and an application-based approach. The results demonstrate the effectiveness of IterSHAP in selecting informative features and improving classification performance on small, high-dimensional datasets.
Original language | English |
---|---|
Title of host publication | Proceedings of the Future Technologies Conference (FTC) 2024, Volume 2 |
Editors | Kohei Arai |
Publisher | Springer |
Pages | 526-545 |
Number of pages | 20 |
ISBN (Print) | 9783031731211 |
DOIs | |
Publication status | Published - 5 Nov 2024 |
Event | 9th Future Technologies Conference, FTC 2024 - London, United Kingdom Duration: 14 Nov 2024 → 15 Nov 2024 Conference number: 9 |
Publication series
Name | Lecture Notes in Networks and Systems |
---|---|
Volume | 1155 LNNS |
ISSN (Print) | 2367-3370 |
ISSN (Electronic) | 2367-3389 |
Conference
Conference | 9th Future Technologies Conference, FTC 2024 |
---|---|
Abbreviated title | FTC 2024 |
Country/Territory | United Kingdom |
City | London |
Period | 14/11/24 → 15/11/24 |
Keywords
- 2025 OA procedure
- Feature selection
- High-dimensional data
- SHAP
- Explainable AI