Cookies on this website
We use cookies to ensure that we give you the best experience on our website. If you click 'Continue' we'll assume that you are happy to receive all cookies and you won't see this message again. Click 'Find out more' for information on how to change your cookie settings.

A new study in reinforcement learning theory shows that extending the temporal difference algorithm to unbiased learning under state uncertainty explains the observed ramping behaviour of dopamine neurons.

Original publication

DOI

10.1016/j.cub.2022.01.070

Type

Journal article

Journal

Curr Biol

Publication Date

14/03/2022

Volume

32

Pages

R213 - R215