Provable Representation Learning for Imitation Learning via Bi-level Optimization

  • Sanjeev Arora
  • , Simon S. Du
  • , Sham Kakade
  • , Yuping Luo
  • , Nikunj Saunshi

Research output: Contribution to journalConference articlepeer-review

11 Scopus citations

Abstract

A common strategy in modern learning systems is to learn a representation that is useful for many tasks, a.k.a. representation learning. We study this strategy in the imitation learning setting for Markov decision processes (MDPs) where multiple experts’ trajectories are available. We formulate representation learning as a bi-level optimization problem where the “outer” optimization tries to learn the joint representation and the “inner” optimization encodes the imitation learning setup and tries to learn task-specific parameters. We instantiate this framework for the imitation learning settings of behavior cloning and observation-alone. Theoretically, we show using our framework that representation learning can provide sample complexity benefits for imitation learning in both settings. We also provide proof-of-concept experiments to verify our theory.

Original languageEnglish (US)
Pages (from-to)367-376
Number of pages10
JournalProceedings of Machine Learning Research
Volume119
StatePublished - 2020
Event37th International Conference on Machine Learning, ICML 2020 - Virtual, Online
Duration: Jul 13 2020Jul 18 2020

All Science Journal Classification (ASJC) codes

  • Software
  • Control and Systems Engineering
  • Statistics and Probability
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Provable Representation Learning for Imitation Learning via Bi-level Optimization'. Together they form a unique fingerprint.

Cite this