Agnostic Q-learning with function approximation in deterministic systems: Near-optimal bounds on approximation error and sample complexity

Simon S. Du, Jason D. Lee, Gaurav Mahajan, Ruosong Wang

Research output: Contribution to journalConference articlepeer-review

8 Scopus citations

Fingerprint

Dive into the research topics of 'Agnostic Q-learning with function approximation in deterministic systems: Near-optimal bounds on approximation error and sample complexity'. Together they form a unique fingerprint.

Engineering & Materials Science