Distributional Cloning for Stabilized Imitation Learning via ADMM

Xin Zhang, Yanhua Li, Ziming Zhang, Christopher G. Brinton, Zhenming Liu, Zhi Li Zhang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The two leading solution paradigms for imitation learning (IL), BC and GAIL, each suffers from notable drawbacks. BC, a supervised learning approach to mimic expert actions, is vulnerable to covariate shift. GAIL applies adversarial training to minimize the discrepancy between expert and learner behaviors, which is prone to unstable training and mode collapse. In this work, we propose DC - Distributional Cloning - a novel IL approach for addressing the covariate shift and mode collapse problems simultaneously. DC directly maximizes the likelihood of observed expert and learner demonstrations, and gradually encourages the learner to evolve towards expert behaviors based on an averaging effect. The DC solution framework contains two stages in each training loop, where in stage one the mixed expert and learner state distribution is estimated via SoftFlow, and in stage two the learner policy is trained to match both the expert's policy and state distribution via ADMM. Experimental evaluation of DC compared with several baselines in 10 different physics-based control tasks reveal superior results in learner policy performance, training stability, and mode distribution preservation.

Original languageEnglish (US)
Title of host publicationProceedings - 23rd IEEE International Conference on Data Mining, ICDM 2023
EditorsGuihai Chen, Latifur Khan, Xiaofeng Gao, Meikang Qiu, Witold Pedrycz, Xindong Wu
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages818-827
Number of pages10
ISBN (Electronic)9798350307887
DOIs
StatePublished - 2023
Externally publishedYes
Event23rd IEEE International Conference on Data Mining, ICDM 2023 - Shanghai, China
Duration: Dec 1 2023Dec 4 2023

Publication series

NameProceedings - IEEE International Conference on Data Mining, ICDM
ISSN (Print)1550-4786

Conference

Conference23rd IEEE International Conference on Data Mining, ICDM 2023
Country/TerritoryChina
CityShanghai
Period12/1/2312/4/23

All Science Journal Classification (ASJC) codes

  • General Engineering

Keywords

  • imitation learning
  • neural ordinary differential equations

Fingerprint

Dive into the research topics of 'Distributional Cloning for Stabilized Imitation Learning via ADMM'. Together they form a unique fingerprint.

Cite this