A distributional code for value in dopaminebased reinforcement learning

Summary

A distributional code for value in dopaminebased reinforcement learning

21 May 2021 - Yubin Wu

This paper shows that dopamine signal in ventral tagmental area is not uniform ,but rather every dopamine neuron learns a different value for RPE (reward prediction error) calculation.these values together constitutes the reward distribution,which can be much more useful than learning just a single value.

Original author： Yubin Wu
Link： https://CNeuroUSTC.github.io/2021/05/21/YubinWu.html
Copyright Notice： All articles on this WenLab site are subject to a BY-NC-SA license agreement, except for special notices. Reprint please indicate the source!