Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes

  • Alekh Agarwal
  • , Sham M. Kakade
  • , Jason D. Lee
  • , Gaurav Mahajan

Research output: Contribution to journalConference articlepeer-review

184 Scopus citations

Fingerprint

Dive into the research topics of 'Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes'. Together they form a unique fingerprint.

Mathematics

Keyphrases