TY - GEN
T1 - Dopamine and inference about timing
AU - Daw, N. D.
AU - Courville, A. C.
AU - Touretzky, D. S.
N1 - Publisher Copyright:
© 2002 IEEE.
PY - 2002
Y1 - 2002
N2 - Several investigators have suggested that the primate dopamine system carries an error signal for learning to predict future rewards. These models, based on temporal-difference (TD) learning, explain most phasic responses of primate dopamine neurons in appetitive conditioning; moreover, they suggest a neurophysiological account of animal conditioning behavior. But because existing models are based in the simple formal setting of Markov processes, they are deficient in at least two areas relevant to physiological and behavioral data. They do not provide a realistic account of the partial observability of the state of the world, nor of how the system tracks the timing of events. In this paper, we introduce a version of TD learning grounded in a richer formal model to better address both issues and, consequently, to explain some data that challenge existing models.
AB - Several investigators have suggested that the primate dopamine system carries an error signal for learning to predict future rewards. These models, based on temporal-difference (TD) learning, explain most phasic responses of primate dopamine neurons in appetitive conditioning; moreover, they suggest a neurophysiological account of animal conditioning behavior. But because existing models are based in the simple formal setting of Markov processes, they are deficient in at least two areas relevant to physiological and behavioral data. They do not provide a realistic account of the partial observability of the state of the world, nor of how the system tracks the timing of events. In this paper, we introduce a version of TD learning grounded in a richer formal model to better address both issues and, consequently, to explain some data that challenge existing models.
UR - http://www.scopus.com/inward/record.url?scp=0038503086&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0038503086&partnerID=8YFLogxK
U2 - 10.1109/DEVLRN.2002.1011901
DO - 10.1109/DEVLRN.2002.1011901
M3 - Conference contribution
AN - SCOPUS:0038503086
T3 - Proceedings - 2nd International Conference on Development and Learning, ICDL 2002
SP - 271
EP - 276
BT - Proceedings - 2nd International Conference on Development and Learning, ICDL 2002
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2nd International Conference on Development and Learning, ICDL 2002
Y2 - 12 June 2002 through 15 June 2002
ER -