Neuromorphic dreaming: A pathway to efficient learning in artificial agents (2405.15616v1)

Published 24 May 2024 in cs.AI, cs.LG, and cs.NE

Abstract: Achieving energy efficiency in learning is a key challenge for AI computing platforms. Biological systems demonstrate remarkable abilities to learn complex skills quickly and efficiently. Inspired by this, we present a hardware implementation of model-based reinforcement learning (MBRL) using spiking neural networks (SNNs) on mixed-signal analog/digital neuromorphic hardware. This approach leverages the energy efficiency of mixed-signal neuromorphic chips while achieving high sample efficiency through an alternation of online learning, referred to as the "awake" phase, and offline learning, known as the "dreaming" phase. The model proposed includes two symbiotic networks: an agent network that learns by combining real and simulated experiences, and a learned world model network that generates the simulated experiences. We validate the model by training the hardware implementation to play the Atari game Pong. We start from a baseline consisting of an agent network learning without a world model and dreaming, which successfully learns to play the game. By incorporating dreaming, the number of required real game experiences are reduced significantly compared to the baseline. The networks are implemented using a mixed-signal neuromorphic processor, with the readout layers trained using a computer in-the-loop, while the other layers remain fixed. These results pave the way toward energy-efficient neuromorphic learning systems capable of rapid learning in real world applications and use-cases.

References (28)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/mr__py/status/1794997349162557734

https://twitter.com/DarylC71/status/1795088616219099486

HackerNews

Neuromorphic dreaming: A pathway to efficient learning in artificial agents (2 points, 0 comments)

Neuromorphic dreaming: A pathway to efficient learning in artificial agents (2405.15616v1)

Summary

Related Papers

Tweets

HackerNews