Timezone: »
Actor-critic algorithms for reinforcement learning are achieving renewed popularity due to their good convergence properties in situations where other approaches often fail (e.g., when function approximation is involved). Interestingly, there is growing evidence that actor-critic approaches based on phasic dopamine signals play a key role in biological learning through the cortical and basal ganglia. We derive a temporal difference based actor critic learning algorithm, for which convergence can be proved without assuming separate time scales for the actor and the critic. The approach is demonstrated by applying it to networks of spiking neurons. The established relation between phasic dopamine and the temporal difference signal lends support to the biological relevance of such algorithms.
Author Information
Dotan Di Castro (Technion, Israel Institute of Technology)
Dima Volkinshtein
Ron Meir (Technion)
More from the Same Authors
-
2022 Poster: Integral Probability Metrics PAC-Bayes Bounds »
Ron Amit · Baruch Epstein · Shay Moran · Ron Meir -
2021 Poster: A Theory of the Distortion-Perception Tradeoff in Wasserstein Space »
Dror Freirich · Tomer Michaeli · Ron Meir -
2015 Poster: A Tractable Approximation to Optimal Point Process Filtering: Application to Neural Encoding »
Yuval Harel · Ron Meir · Manfred Opper -
2015 Spotlight: A Tractable Approximation to Optimal Point Process Filtering: Application to Neural Encoding »
Yuval Harel · Ron Meir · Manfred Opper -
2014 Poster: Optimal Neural Codes for Control and Estimation »
Alex K Susemihl · Ron Meir · Manfred Opper -
2014 Poster: Expectation Backpropagation: Parameter-Free Training of Multilayer Neural Networks with Continuous or Discrete Weights »
Daniel Soudry · Itay Hubara · Ron Meir -
2011 Poster: Analytical Results for the Error in Filtering of Gaussian Processes »
Alex K Susemihl · Ron Meir · Manfred Opper -
2007 Oral: A neural network implementing optimal state estimation based on dynamic spike train decoding »
Omer Bobrowski · Ron Meir · Shy Shoham · Yonina Eldar -
2007 Poster: A neural network implementing optimal state estimation based on dynamic spike train decoding »
Omer Bobrowski · Ron Meir · Shy Shoham · Yonina Eldar