Defining admissible rewards for high-confidence policy evaluation in batch reinforcement learning
- Niranjani Prasad
- , Barbara Engelhardt
- , Finale Doshi-Velez
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
1
Link opens in a new tab
Scopus
citations