Skip to main navigation Skip to search Skip to main content

Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes

  • Alekh Agarwal
  • , Sham M. Kakade
  • , Jason D. Lee
  • , Gaurav Mahajan

Research output: Contribution to journalConference articlepeer-review

Fingerprint

Dive into the research topics of 'Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes'. Together they form a unique fingerprint.
Sort by

Keyphrases

Mathematics