Pixel-based clustering for local interpretable model-agnostic explanations

Szczegóły
Abstrakt

Tytuł:: Pixel-based clustering for local interpretable model-agnostic explanations
Autorzy:: Qian, Junyan
Wen, Tong
Ling, Ming
Du, Xiaofu
Ding, Hao
Data publikacji:: 2025
Słowa kluczowe:: interpretability
explainable artificial intelligence
model-agnostic explanations
black-box model
Język:: angielski
Dostawca treści:: BazTech
: Artykuł

To enhance the interpretability of black-box machine learning, model-agnostic explanations have become a focal point of interest. This paper introduces Pixel-based Local Interpretable Model-agnostic Explanations (PLIME), a method that generates perturbation samples via pixel clustering to derive raw explanations. Through iterative refinement, it reduces the number of features, culminating in an optimal feature set that best predicts the model’s score. PLIME increases the relevance of features associated with correct predictions in the explanations. A comprehensive evaluation of PLIME is conducted against LIME and SHAP, focusing on faithfulness, stability, and minimality. Additionally, the predictions from both PLIME and LIME are utilized in Random Input Sampling Explanations (RISE) to compare minimality. The results demonstrate PLIME’s significant advantages in stability, faithfulness, and minimality.

Opracowanie rekordu ze środków MNiSW, umowa nr POPUL/SP/0154/2024/02 w ramach programu "Społeczna odpowiedzialność nauki II" - moduł: Popularyzacja nauki (2025).

Informacja

Pixel-based clustering for local interpretable model-agnostic explanations