IterSHAP: An XAI-Based Feature Selection Method for Small High-Dimensional Datasets

Frank van Mourik, Maryam Amir Haeri, Faiza A. Bukhsh, Faizan Ahmed*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

Small high-dimensional datasets pose challenges for achieving accurate predictive models, due to issues like overfitting and the curse of dimensionality. While complex models, like deep learning models, have been used to address these challenges, they often lack interpretability and transparency. Explainable Artificial Intelligence (XAI) is a popular field that aims to bridge this gap by developing techniques that provide insights into the decision-making process of machine learning models, therefore increasing the explainability and trustworthiness of models. However, when applying feature selection before model training, less complex models can be used, such that interpretability and explainability are preserved, and explainable AI methods are not even needed. However, in this case, we need an explainable feature selection method. This research presents an iterative feature selection method named IterSHAP, which utilizes a popular XAI technique named SHAP, to increase model performance on small high-dimensional datasets. The performance of IterSHAP was evaluated via both a simulation-based approach and an application-based approach. The results demonstrate the effectiveness of IterSHAP in selecting informative features and improving classification performance on small, high-dimensional datasets.

Original languageEnglish
Title of host publicationProceedings of the Future Technologies Conference (FTC) 2024, Volume 2
EditorsKohei Arai
PublisherSpringer
Pages526-545
Number of pages20
ISBN (Print)9783031731211
DOIs
Publication statusPublished - 5 Nov 2024
Event9th Future Technologies Conference, FTC 2024 - London, United Kingdom
Duration: 14 Nov 202415 Nov 2024
Conference number: 9

Publication series

NameLecture Notes in Networks and Systems
Volume1155 LNNS
ISSN (Print)2367-3370
ISSN (Electronic)2367-3389

Conference

Conference9th Future Technologies Conference, FTC 2024
Abbreviated titleFTC 2024
Country/TerritoryUnited Kingdom
CityLondon
Period14/11/2415/11/24

Keywords

  • 2025 OA procedure
  • Feature selection
  • High-dimensional data
  • SHAP
  • Explainable AI

Fingerprint

Dive into the research topics of 'IterSHAP: An XAI-Based Feature Selection Method for Small High-Dimensional Datasets'. Together they form a unique fingerprint.

Cite this