Implementing Neural Turing Machines (1807.08518v3)

Published 23 Jul 2018 in cs.LG and stat.ML

Abstract: Neural Turing Machines (NTMs) are an instance of Memory Augmented Neural Networks, a new class of recurrent neural networks which decouple computation from memory by introducing an external memory unit. NTMs have demonstrated superior performance over Long Short-Term Memory Cells in several sequence learning tasks. A number of open source implementations of NTMs exist but are unstable during training and/or fail to replicate the reported performance of NTMs. This paper presents the details of our successful implementation of a NTM. Our implementation learns to solve three sequential learning tasks from the original NTM paper. We find that the choice of memory contents initialization scheme is crucial in successfully implementing a NTM. Networks with memory contents initialized to small constant values converge on average 2 times faster than the next best memory contents initialization scheme.

Authors (2)

Mark Collier (19 papers)
Joeran Beel (42 papers)

Citations (47)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Implementing Neural Turing Machines (1807.08518v3)

Summary

Related Papers