Reward estimation for variance reduction in deep reinforcement learning

Joshua Romoff, Alexandre Piché, Peter Henderson, Vincent Francois-Lavet, Joelle Pineau

Research output: Contribution to conferencePaperpeer-review

10 Scopus citations

Fingerprint

Dive into the research topics of 'Reward estimation for variance reduction in deep reinforcement learning'. Together they form a unique fingerprint.

Mathematics

Engineering

Neuroscience

Keyphrases