Reward prediction error as an exploration objective in deep RL
- Riley Simmons-Edler
- , Ben Eisner
- , Daniel Yang
- , Anthony Bisulco
- , Eric Mitchell
- , Sebastian Seung
- , Daniel Lee
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
3
Link opens in a new tab
Scopus
citations