PROVABLE OFFLINE PREFERENCE-BASED REINFORCEMENT LEARNING

Wenhao Zhan, Masatoshi Uehara, Nathan Kallus, Jason D. Lee, Wen Sun

Research output: Contribution to conferencePaperpeer-review

Fingerprint

Dive into the research topics of 'PROVABLE OFFLINE PREFERENCE-BASED REINFORCEMENT LEARNING'. Together they form a unique fingerprint.

Computer Science

Mathematics

Keyphrases