Beyond Cumulative Returns via Reinforcement Learning over State-Action Occupancy Measures

Junyu Zhang, Amrit Singh Bedi, Mengdi Wang, Alec Koppel

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Scopus citations

Fingerprint

Dive into the research topics of 'Beyond Cumulative Returns via Reinforcement Learning over State-Action Occupancy Measures'. Together they form a unique fingerprint.

Keyphrases

Mathematics

Computer Science