TY - GEN
T1 - A Regret Minimization Approach to Iterative Learning Control
AU - Agarwal, Naman
AU - Hazan, Elad
AU - Majumdar, Anirudha
AU - Singh, Karan
N1 - Publisher Copyright:
Copyright © 2021 by the author(s)
PY - 2021
Y1 - 2021
N2 - We consider the setting of iterative learning control, or model-based policy learning in the presence of uncertain, time-varying dynamics. In this setting, we propose a new performance metric, planning regret, which replaces the standard stochastic uncertainty assumptions with worst case regret. Based on recent advances in nonstochastic control, we design a new iterative algorithm for minimizing planning regret that is more robust to model mismatch and uncertainty. We provide theoretical and empirical evidence that the proposed algorithm outperforms existing methods on several benchmarks.
AB - We consider the setting of iterative learning control, or model-based policy learning in the presence of uncertain, time-varying dynamics. In this setting, we propose a new performance metric, planning regret, which replaces the standard stochastic uncertainty assumptions with worst case regret. Based on recent advances in nonstochastic control, we design a new iterative algorithm for minimizing planning regret that is more robust to model mismatch and uncertainty. We provide theoretical and empirical evidence that the proposed algorithm outperforms existing methods on several benchmarks.
UR - http://www.scopus.com/inward/record.url?scp=85119879055&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85119879055&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85119879055
T3 - Proceedings of Machine Learning Research
SP - 100
EP - 109
BT - Proceedings of the 38th International Conference on Machine Learning, ICML 2021
PB - ML Research Press
T2 - 38th International Conference on Machine Learning, ICML 2021
Y2 - 18 July 2021 through 24 July 2021
ER -