Explaining Predictive Uncertainty by Looking Back at Model Explanations (2201.03742v2)

Published 11 Jan 2022 in cs.CL

Abstract: Predictive uncertainty estimation of pre-trained LLMs is an important measure of how likely people can trust their predictions. However, little is known about what makes a model prediction uncertain. Explaining predictive uncertainty is an important complement to explaining prediction labels in helping users understand model decision making and gaining their trust on model predictions, while has been largely ignored in prior works. In this work, we propose to explain the predictive uncertainty of pre-trained LLMs by extracting uncertain words from existing model explanations. We find the uncertain words are those identified as making negative contributions to prediction labels, while actually explaining the predictive uncertainty. Experiments show that uncertainty explanations are indispensable to explaining models and helping humans understand model prediction behavior.

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Explaining Predictive Uncertainty by Looking Back at Model Explanations (2201.03742v2)

Summary

Related Papers