Fingerprint
Dive into the research topics of 'Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes'. Together they form a unique fingerprint.- Sort by
- Weight
- Alphabetically
Alekh Agarwal, Sham M. Kakade, Jason D. Lee, Gaurav Mahajan
Research output: Contribution to journal › Conference article › peer-review