Optimistic Natural Policy Gradient: a Simple Efficient Policy Optimization Framework for Online RL
- Qinghua Liu
- , Chi Jin
- , Gellért Weisz
- , András György
- , Csaba Szepesvári
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
6
Link opens in a new tab
Scopus
citations