OFFLINE REINFORCEMENT LEARNING WITH DIFFERENTIABLE FUNCTION APPROXIMATION IS PROVABLY EFFICIENT

Ming Yin, Mengdi Wang, Yu Xiang Wang

Research output: Contribution to conferencePaperpeer-review

3 Scopus citations

Fingerprint

Dive into the research topics of 'OFFLINE REINFORCEMENT LEARNING WITH DIFFERENTIABLE FUNCTION APPROXIMATION IS PROVABLY EFFICIENT'. Together they form a unique fingerprint.

Computer Science

Keyphrases