Abstract
A long-standing challenge for psychology and neuroscience is to understand the transformations by which past experiences shape future behaviour. Reward-guided learning is typically modelled using simple reinforcement learning (RL) algorithms. In RL, a handful of incrementally updated internal variables both summarize past rewards and drive future choice. Here we describe work that questions the assumptions of many RL models. We adopt a hybrid modelling approach that integrates artificial neural networks into interpretable cognitive architectures, estimating a maximally general form for each algorithmic component and systematically evaluating its necessity and sufficiency. Applying this method to a large dataset of human reward-learning behaviour, we show that successful models require independent and flexible memory variables that can track rich representations of the past. Using a modelling approach that combines predictive accuracy and interpretability, these results call into question an entire class of popular RL models based on incremental updating of scalar reward predictions.
| Original language | English (US) |
|---|---|
| Pages (from-to) | 972-987 |
| Number of pages | 16 |
| Journal | Nature Human Behaviour |
| Volume | 10 |
| Issue number | 5 |
| DOIs | |
| State | Published - May 2026 |
All Science Journal Classification (ASJC) codes
- Social Psychology
- Experimental and Cognitive Psychology
- Behavioral Neuroscience
Fingerprint
Dive into the research topics of 'Hybrid neural–cognitive models reveal how memory shapes human reward learning'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver