MEME: Generating RNN Model Explanations via Model Extraction (2012.06954v1)

Published 13 Dec 2020 in cs.LG and cs.AI

Abstract: Recurrent Neural Networks (RNNs) have achieved remarkable performance on a range of tasks. A key step to further empowering RNN-based approaches is improving their explainability and interpretability. In this work we present MEME: a model extraction approach capable of approximating RNNs with interpretable models represented by human-understandable concepts and their interactions. We demonstrate how MEME can be applied to two multivariate, continuous data case studies: Room Occupation Prediction, and In-Hospital Mortality Prediction. Using these case-studies, we show how our extracted models can be used to interpret RNNs both locally and globally, by approximating RNN decision-making via interpretable concept interactions.

Citations (13)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GitHub

GitHub - dmitrykazhdan/MEME-RNN-XAI: MEME: Generating RNN Model Explanations via Model Extraction (15 stars)

MEME: Generating RNN Model Explanations via Model Extraction (2012.06954v1)

Summary

Related Papers

GitHub