Reward prediction error as an exploration objective in deep RL

Riley Simmons-Edler, Ben Eisner, Daniel Yang, Anthony Bisulco, Eric Mitchell, Sebastian Seung, Daniel Lee

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Scopus citations

Fingerprint

Dive into the research topics of 'Reward prediction error as an exploration objective in deep RL'. Together they form a unique fingerprint.

Engineering & Materials Science