NEAR-OPTIMAL OFFLINE REINFORCEMENT LEARNING WITH LINEAR REPRESENTATION: LEVERAGING VARIANCE INFORMATION WITH PESSIMISM

Ming Yin, Yaqi Duan, Mengdi Wang, Yu Xiang Wang

Research output: Contribution to conferencePaperpeer-review

26 Scopus citations

Fingerprint

Dive into the research topics of 'NEAR-OPTIMAL OFFLINE REINFORCEMENT LEARNING WITH LINEAR REPRESENTATION: LEVERAGING VARIANCE INFORMATION WITH PESSIMISM'. Together they form a unique fingerprint.

Computer Science

Keyphrases

Mathematics