Learning from Neuronal Populations
Author Information
Author(s): Friedrich Johannes Urbanczik, Robert Senn
Primary Institution: Department of Physiology, University of Bern, Bern, Switzerland
Hypothesis
How can synapses adapt when reward delivery is delayed and depends on the releases of many other synapses?
Conclusion
The proposed model provides a viable mechanism for temporal credit assignment in reinforcement learning.
Supporting Evidence
- The model shows that learning speeds up with increasing population size.
- Simulations demonstrate that learning can occur even with significant delays in reward delivery.
- The proposed plasticity rule effectively addresses both spatial and temporal credit assignment problems.
Takeaway
This study shows how brain cells can learn from their mistakes even when rewards come late, by using a special memory system.
Methodology
The study uses simulations of a model with leaky integrate-and-fire neurons to explore reinforcement learning mechanisms.
Limitations
The model may not fully capture the complexities of real biological systems and relies on certain assumptions about synaptic behavior.
Digital Object Identifier (DOI)
Want to read the original?
Access the complete publication on the publisher's website