POLICY MIRROR DESCENT FOR REGULARIZED REINFORCEMENT LEARNING: A GENERALIZED FRAMEWORK WITH LINEAR CONVERGENCE

Wenhao Zhan, Shicong Cen, Baihe Huang, Yuxin Chen, Jason D. Lee, Yuejie Chi

Research output: Contribution to journalArticlepeer-review

12 Scopus citations

Fingerprint

Dive into the research topics of 'POLICY MIRROR DESCENT FOR REGULARIZED REINFORCEMENT LEARNING: A GENERALIZED FRAMEWORK WITH LINEAR CONVERGENCE'. Together they form a unique fingerprint.

Mathematics

Engineering

Computer Science

Keyphrases