Reinforcement learning in feature space: Matrix bandit, kernels, and regret bound

Lin F. Yang, Mengdi Wang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

39 Scopus citations

Fingerprint

Dive into the research topics of 'Reinforcement learning in feature space: Matrix bandit, kernels, and regret bound'. Together they form a unique fingerprint.

Computer Science

Keyphrases