Abstract
The mesostriatal dopamine system is prominently implicated in model-free reinforcement learning, with fMRI BOLD signals in ventral striatum notably covarying with model-free prediction errors. However, latent learning and devaluation studies show that behavior also shows hallmarks of model-based planning, and the interaction between model-based and model-free values, prediction errors, and preferences is underexplored. We designed a multistep decision task in which model-based and model-free influences on human choice behavior could be distinguished. By showing that choices reflected both influences we could then test the purity of the ventral striatal BOLD signal as a model-free report. Contrary to expectations, the signal reflected both model-free and model-based predictions in proportions matching those that best explained choice behavior. These results challenge the notion of a separate model-free learner and suggest a more integrated computational architecture for high-level human decision-making.
Original language | English (US) |
---|---|
Pages (from-to) | 1204-1215 |
Number of pages | 12 |
Journal | Neuron |
Volume | 69 |
Issue number | 6 |
DOIs | |
State | Published - Mar 24 2011 |
Externally published | Yes |
All Science Journal Classification (ASJC) codes
- General Neuroscience