Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes

Alekh Agarwal, Sham M. Kakade, Jason D. Lee, Gaurav Mahajan

Research output: Contribution to journalConference articlepeer-review

153 Scopus citations

Fingerprint

Dive into the research topics of 'Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes'. Together they form a unique fingerprint.

Mathematics

Keyphrases