PROVABLE REWARD-AGNOSTIC PREFERENCE-BASED REINFORCEMENT LEARNING

Wenhao Zhan, Masatoshi Uehara, Wen Sun, Jason D. Lee

Research output: Contribution to conferencePaperpeer-review

Fingerprint

Dive into the research topics of 'PROVABLE REWARD-AGNOSTIC PREFERENCE-BASED REINFORCEMENT LEARNING'. Together they form a unique fingerprint.

Computer Science

Keyphrases