Defining admissible rewards for high-confidence policy evaluation in batch reinforcement learning

Niranjani Prasad, Barbara Engelhardt, Finale Doshi-Velez

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Fingerprint

Dive into the research topics of 'Defining admissible rewards for high-confidence policy evaluation in batch reinforcement learning'. Together they form a unique fingerprint.

Keyphrases

Computer Science