Evaluating Saliency Map Explanations for Convolutional Neural Networks: A User Study (2002.00772v1)

Published 3 Feb 2020 in cs.HC

Abstract: Convolutional neural networks (CNNs) offer great machine learning performance over a range of applications, but their operation is hard to interpret, even for experts. Various explanation algorithms have been proposed to address this issue, yet limited research effort has been reported concerning their user evaluation. In this paper, we report on an online between-group user study designed to evaluate the performance of "saliency maps" - a popular explanation algorithm for image classification applications of CNNs. Our results indicate that saliency maps produced by the LRP algorithm helped participants to learn about some specific image features the system is sensitive to. However, the maps seem to provide very limited help for participants to anticipate the network's output for new images. Drawing on our findings, we highlight implications for design and further research on explainable AI. In particular, we argue the HCI and AI communities should look beyond instance-level explanations.

Citations (182)

View on Semantic Scholar

Summary

The paper demonstrates that saliency maps slightly improve user prediction accuracy for CNN classifications, reaching a 60.7% success rate.
The study employs an online between-group experiment with 64 participants using Layer-wise Relevance Propagation to generate interpretable heat maps.
It highlights that while saliency maps effectively pinpoint influential features, they fall short in helping users correctly identify misclassifications, suggesting a need for more holistic explanation tools.

Insights from Evaluating Saliency Map Explanations for Convolutional Neural Networks

The paper conducted by Alqaraawi et al. focuses on the utility and limitations of saliency maps as an explanation tool for convolutional neural networks (CNNs) used in multi-label image classification tasks. As CNNs excel in a variety of machine learning applications, understanding their decision-making process presents a significant challenge, especially for users who are not experts in the field. The research reported in this paper addresses this by evaluating whether saliency maps, which indicate which parts of an image contribute most to a CNN's classification decision, aid users in predicting how a network will classify unseen images.

Methodology

The authors executed an online between-group user paper, where 64 participants were asked to predict the classification output of a CNN trained on the PASCAL VOC 2012 dataset. This dataset consists of images containing multiple objects, which the CNN was tasked with identifying. Participants were exposed to varying levels of information: some saw only the original image, while others had access to additional data in the form of saliency maps and classification scores. The saliency maps were generated using the Layer-wise Relevance Propagation (LRP) technique, known for producing highly interpretable heat maps.

Results

The paper reveals that the presence of saliency maps marginally increased the accuracy of users' predictions, reaching a 60.7% success rate compared to a lower rate without these maps. However, despite the improvement, the participants' ability to predict misclassifications, such as false positives and false negatives, remained weak, with accuracy for these cases lingering below 50%. The research underscores that saliency maps prompt users to focus on highlighted features, potentially aiding feature recognition but not necessarily broadening overall understanding of the CNN's operational nuances.

Implications and Limitations

The findings highlight the potential yet limited role of saliency maps in improving user comprehension of CNNs. While saliency maps can draw attention to features significant to CNN decisions, they may inadvertently lead participants to neglect other important aspects such as image context or quality. This insight calls for the development of more comprehensive explanation systems that also incorporate global image attributes, possibly aiding in assembling a more holistic understanding of CNN outputs. Furthermore, the prominence of highlighted features in saliency maps points to the need for additional explanatory paradigms that provide quantification of feature importance across varied contexts.

Future Directions

The paper suggests several avenues for future research, including the enhancement of algorithmic strategies for selecting representative examples and probing the utility of saliency maps across various model architectures and datasets. The paper advocates for integrating more global explanation metrics alongside instance-based explanations such as saliency maps to facilitate a more profound comprehension of model behavior, particularly when dealing with complex image datasets.

In conclusion, while saliency maps hold promise as part of the toolbox for explainable AI, their effectiveness in isolation is insufficient for fully demystifying CNN classification mechanisms. The interplay between human cognitive biases, machine learning processes, and explanation modalities remains an expansive area for exploration, requiring sophisticated techniques that bridge the gap between machine outputs and human interpretation.

PDF Markdown