Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
149 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Neuromorphic dreaming: A pathway to efficient learning in artificial agents (2405.15616v1)

Published 24 May 2024 in cs.AI, cs.LG, and cs.NE

Abstract: Achieving energy efficiency in learning is a key challenge for AI computing platforms. Biological systems demonstrate remarkable abilities to learn complex skills quickly and efficiently. Inspired by this, we present a hardware implementation of model-based reinforcement learning (MBRL) using spiking neural networks (SNNs) on mixed-signal analog/digital neuromorphic hardware. This approach leverages the energy efficiency of mixed-signal neuromorphic chips while achieving high sample efficiency through an alternation of online learning, referred to as the "awake" phase, and offline learning, known as the "dreaming" phase. The model proposed includes two symbiotic networks: an agent network that learns by combining real and simulated experiences, and a learned world model network that generates the simulated experiences. We validate the model by training the hardware implementation to play the Atari game Pong. We start from a baseline consisting of an agent network learning without a world model and dreaming, which successfully learns to play the game. By incorporating dreaming, the number of required real game experiences are reduced significantly compared to the baseline. The networks are implemented using a mixed-signal neuromorphic processor, with the readout layers trained using a computer in-the-loop, while the other layers remain fixed. These results pave the way toward energy-efficient neuromorphic learning systems capable of rapid learning in real world applications and use-cases.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (28)
  1. Improved robustness of reinforcement learning policies upon conversion to spiking neuronal network platforms applied to atari breakout game. Neural Networks, 120:108–115, 2019.
  2. Deep reinforcement learning with population-coded spiking neural network for continuous control. In Conference on Robot Learning, pages 2016–2029. PMLR, 2021.
  3. Toward robust and scalable deep spiking reinforcement learning. Frontiers in Neurorobotics, 16:1075647, 2023.
  4. The remarkable robustness of surrogate gradient learning for instilling complex function in spiking neural networks. Neural computation, 33(4):899–925, 2021.
  5. Spike-based local synaptic plasticity: a survey of computational models and neuromorphic circuits. Neuromorphic Computing and Engineering, 3(4):042001, November 2023.
  6. Răzvan V Florian. Reinforcement learning through modulation of spike-timing-dependent synaptic plasticity. Neural computation, 19(6):1468–1502, 2007.
  7. Reinforcement learning using a continuous time actor-critic framework with spiking neurons. PLoS computational biology, 9(4):e1003024, 2013.
  8. A solution to the learning dilemma for recurrent networks of spiking neurons. Nature communications, 11(1):1–15, 2020.
  9. Optimized spiking neurons can classify images with high accuracy through temporal coding with two spikes. Nature Machine Intelligence, 3(3):230–238, 2021.
  10. Towards biologically plausible dreaming and planning in recurrent spiking networks. arXiv preprint arXiv:2205.10044, 2022.
  11. A solution to the learning dilemma for recurrent networks of spiking neurons. Nature communications, 11(1):3625, 2020.
  12. Carver Mead. Neuromorphic electronic systems. Proceedings of the IEEE, 78(10):1629–1636, 1990.
  13. Neuromorphic silicon neuron circuits. Frontiers in neuroscience, 5:9202, 2011.
  14. A scalable multicore architecture with heterogeneous memory structures for dynamic neuromorphic asynchronous processors (dynaps). IEEE transactions on biomedical circuits and systems, 12(1):106–122, 2017.
  15. Neuromorphic electronic circuits for building autonomous cognitive systems. Proceedings of the IEEE, 102(9):1367–1388, 2014.
  16. Characterization of subthreshold mos mismatch in transistors for vlsi systems. Journal of VLSI signal processing systems for signal, image and video technology, 8:75–85, 1994.
  17. Brain-inspired methods for achieving robust computation in heterogeneous mixed-signal neuromorphic processing systems. Neuromorphic Computing and Engineering, 3(3):034002, 2023.
  18. Immunity to device variations in a spiking neural network with memristive nanodevices. IEEE transactions on nanotechnology, 12(3):288–295, 2013.
  19. Integration of nanoscale memristor synapses in neuromorphic computing architectures. Nanotechnology, 24(38):384010, 2013.
  20. A neuromorphic systems approach to in-memory computing with non-ideal memristive devices: From mitigation to exploitation. Faraday Discussions, 213:487–510, 2019.
  21. Synaptic modifications in cultured hippocampal neurons: dependence on spike timing, synaptic strength, and postsynaptic cell type. Journal of neuroscience, 18(24):10464–10472, 1998.
  22. Sleep, learning, and dreams: off-line memory reprocessing. Science, 294(5544):1052–1057, 2001.
  23. Sleep-dependent learning and memory consolidation. Neuron, 44(1):121–133, 2004.
  24. Efficient computation and cue integration with noisy population codes. Nature Neuroscience, 4(8):826–831, 2001.
  25. Openai gym. arXiv preprint arXiv:1606.01540, 2016.
  26. Cristiano Capone. Towards biologically plausible dreaming and planning (code). https://github.com/author/repo, 2023. Accessed: 2023-06.
  27. Samna: developer interface to the synsense toolchain and run-time environment for interacting with all synsense devices. https://synsense-sys-int.gitlab.io/samna/.
  28. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.

Summary

We haven't generated a summary for this paper yet.