NEAR-OPTIMAL OFFLINE REINFORCEMENT LEARNING WITH LINEAR REPRESENTATION: LEVERAGING VARIANCE INFORMATION WITH PESSIMISM

Ming Yin, Yaqi Duan, Mengdi Wang, Yu Xiang Wang

Research output: Contribution to conferencePaperpeer-review

14 Scopus citations

Fingerprint

Dive into the research topics of 'NEAR-OPTIMAL OFFLINE REINFORCEMENT LEARNING WITH LINEAR REPRESENTATION: LEVERAGING VARIANCE INFORMATION WITH PESSIMISM'. Together they form a unique fingerprint.

Arts & Humanities

Social Sciences

Engineering & Materials Science