Context-Free Transductions with Neural Stacks (1809.02836v1)

Published 8 Sep 2018 in cs.NE, cs.CL, and cs.LG

Abstract: This paper analyzes the behavior of stack-augmented recurrent neural network (RNN) models. Due to the architectural similarity between stack RNNs and pushdown transducers, we train stack RNN models on a number of tasks, including string reversal, context-free LLMling, and cumulative XOR evaluation. Examining the behavior of our networks, we show that stack-augmented RNNs can discover intuitive stack-based strategies for solving our tasks. However, stack RNNs are more difficult to train than classical architectures such as LSTMs. Rather than employ stack-based strategies, more complex networks often find approximate solutions by using the stack as unstructured memory.

Citations (38)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Context-Free Transductions with Neural Stacks (1809.02836v1)

Summary

Related Papers