Reinforcement learning in feature space: Matrix bandit, kernels, and regret bound

Lin F. Yang, Mengdi Wang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Scopus citations

Fingerprint

Dive into the research topics of 'Reinforcement learning in feature space: Matrix bandit, kernels, and regret bound'. Together they form a unique fingerprint.

Mathematics

Engineering & Materials Science