TY - JOUR
T1 - Generating Adversarial Disturbances for Controller Verification
AU - Ghai, Udaya
AU - Snyder, David
AU - Majumdar, Anirudha
AU - Hazan, Elad
N1 - Funding Information:
We thank Karan Singh for enlightening discussion. Elad Hazan is partially supported by NSF award # 1704860 as well as the Google corporation. Anirudha Majumdar was partially supported by the Office of Naval Research [Award Number: N00014-18-1-2873]. This material is based upon work supported by the National Science Foundation Graduate Research Fellowship Program under Grant No. DGE-2039656. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.
Publisher Copyright:
© 2021 U. Ghai, D. Snyder, A. Majumdar & E. Hazan.
PY - 2021
Y1 - 2021
N2 - We consider the problem of generating maximally adversarial disturbances for a given controller assuming only blackbox access to it. We propose an online learning approach to this problem that adaptively generates disturbances based on control inputs chosen by the controller. The goal of the disturbance generator is to minimize regret versus a benchmark disturbance-generating policy class, i.e., to maximize the cost incurred by the controller as well as possible compared to the best possible disturbance generator in hindsight (chosen from a benchmark policy class). In the setting where the dynamics are linear and the costs are quadratic, we formulate our problem as an online trust region (OTR) problem with memory and present a new online learning algorithm (MOTR) for this problem. We prove that this method competes with the best disturbance generator in hindsight (chosen from a rich class of benchmark policies that includes linear-dynamical disturbance generating policies). We demonstrate our approach on two simulated examples: (i) synthetically generated linear systems, and (ii) generating wind disturbances for the popular PX4 controller in the AirSim simulator. On these examples, we demonstrate that our approach outperforms several baseline approaches, including H∞ disturbance generation and gradient-based methods.
AB - We consider the problem of generating maximally adversarial disturbances for a given controller assuming only blackbox access to it. We propose an online learning approach to this problem that adaptively generates disturbances based on control inputs chosen by the controller. The goal of the disturbance generator is to minimize regret versus a benchmark disturbance-generating policy class, i.e., to maximize the cost incurred by the controller as well as possible compared to the best possible disturbance generator in hindsight (chosen from a benchmark policy class). In the setting where the dynamics are linear and the costs are quadratic, we formulate our problem as an online trust region (OTR) problem with memory and present a new online learning algorithm (MOTR) for this problem. We prove that this method competes with the best disturbance generator in hindsight (chosen from a rich class of benchmark policies that includes linear-dynamical disturbance generating policies). We demonstrate our approach on two simulated examples: (i) synthetically generated linear systems, and (ii) generating wind disturbances for the popular PX4 controller in the AirSim simulator. On these examples, we demonstrate that our approach outperforms several baseline approaches, including H∞ disturbance generation and gradient-based methods.
KW - Adversarial Disturbances
KW - Controller Verification
KW - Online Learning
UR - http://www.scopus.com/inward/record.url?scp=85124641153&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85124641153&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85124641153
SN - 2640-3498
VL - 144
SP - 1192
EP - 1204
JO - Proceedings of Machine Learning Research
JF - Proceedings of Machine Learning Research
T2 - 3rd Annual Conference on Learning for Dynamics and Control, L4DC 2021
Y2 - 7 June 2021 through 8 June 2021
ER -