Fingerprint
Dive into the research topics of 'On the theory of policy gradient methods: Optimality, approximation, and distribution shift'. Together they form a unique fingerprint.- Sort by
- Weight
- Alphabetically
Alekh Agarwal, Sham M. Kakade, Jason D. Lee, Gaurav Mahajan
Research output: Contribution to journal › Article › peer-review