Dopamine, uncertainty and TD learning

Substantial evidence suggests that the phasic activities of dopaminergic neurons in the primate midbrain represent a temporal difference (TD) error in predictions of future reward, with increases above and decreases below baseline consequent on positive and negative prediction errors, respectively....

Full description

Saved in:

Bibliographic Details
Published in	Behavioral and brain functions Vol. 1; no. 1; p. 6
Main Authors	Niv, Yael, Duff, Michael O, Dayan, Peter
Format	Journal Article
Language	English
Published	England BioMed Central Ltd 04.05.2005 BioMed Central BMC
Subjects	Primates Short Paper
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Substantial evidence suggests that the phasic activities of dopaminergic neurons in the primate midbrain represent a temporal difference (TD) error in predictions of future reward, with increases above and decreases below baseline consequent on positive and negative prediction errors, respectively. However, dopamine cells have very low baseline activity, which implies that the representation of these two sorts of error is asymmetric. We explore the implications of this seemingly innocuous asymmetry for the interpretation of dopaminergic firing patterns in experiments with probabilistic rewards which bring about persistent prediction errors. In particular, we show that when averaging the non-stationary prediction errors across trials, a ramping in the activity of the dopamine neurons should be apparent, whose magnitude is dependent on the learning rate. This exact phenomenon was observed in a recent experiment, though being interpreted there in antipodal terms as a within-trial encoding of uncertainty.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	1744-9081 1744-9081
DOI:	10.1186/1744-9081-1-6