Reinforcement learning: Dopamine ramps with fuzzy value estimates.

Whittington JCR., Behrens TEJ.

A new study in reinforcement learning theory shows that extending the temporal difference algorithm to unbiased learning under state uncertainty explains the observed ramping behaviour of dopamine neurons.

DOI

10.1016/j.cub.2022.01.070

Type

Journal article

Journal

Curr Biol

Publication Date

14/03/2022

Volume

32

Pages

R213 - R215

Permalink Original publication