A memory enhanced LSTM for modeling complex temporal dependencies (1910.12388v1)

Published 25 Oct 2019 in cs.LG and stat.ML

Abstract: In this paper, we present Gamma-LSTM, an enhanced long short term memory (LSTM) unit, to enable learning of hierarchical representations through multiple stages of temporal abstractions. Gamma memory, a hierarchical memory unit, forms the central memory of Gamma-LSTM with gates to regulate the information flow into various levels of hierarchy, thus providing the unit with a control to pick the appropriate level of hierarchy to process the input at a given instant of time. We demonstrate better performance of Gamma-LSTM model regular and stacked LSTMs in two settings (pixel-by-pixel MNIST digit classification and natural language inference) placing emphasis on the ability to generalize over long sequences.

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Related Papers

xLSTM: Extended Long Short-Term Memory (2024)
Working Memory Connections for LSTM (2021)
Cell-aware Stacked LSTMs for Modeling Sentences (2018)
Grow and Prune Compact, Fast, and Accurate LSTMs (2018)
Nested LSTMs (2018)