DASA: Delay-Adaptive Multi-Agent Stochastic Approximation

Nicolo Dal Fabbro, Arman Adibi, H. Vincent Poor, Sanjeev R. Kulkarni, Aritra Mitra, George J. Pappas

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We consider a setting in which N agents aim to speedup a common Stochastic Approximation (SA) problem by acting in parallel and communicating with a central server. We assume that the up-link transmissions to the server are subject to asynchronous and potentially unbounded time-varying delays. To mitigate the effect of delays and stragglers while reaping the benefits of distributed computation, we propose DASA, a Delay-Adaptive algorithm for multi-agent Stochastic Approximation. We provide a finite-time analysis of DASA assuming that the agents' stochastic observation processes are independent Markov chains. Significantly advancing existing results, DASA is the first algorithm whose convergence rate depends only on the mixing time τm i x and on the average delay τa v g while jointly achieving an N-fold convergence speedup under Markovian sampling. Our work is relevant for various SA applications, including multi-agent and distributed temporal difference (TD) learning, Q-learning and stochastic optimization with correlated data.

Original languageEnglish (US)
Title of host publication2024 IEEE 63rd Conference on Decision and Control, CDC 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages3889-3896
Number of pages8
ISBN (Electronic)9798350316339
DOIs
StatePublished - 2024
Externally publishedYes
Event63rd IEEE Conference on Decision and Control, CDC 2024 - Milan, Italy
Duration: Dec 16 2024Dec 19 2024

Publication series

NameProceedings of the IEEE Conference on Decision and Control
ISSN (Print)0743-1546
ISSN (Electronic)2576-2370

Conference

Conference63rd IEEE Conference on Decision and Control, CDC 2024
Country/TerritoryItaly
CityMilan
Period12/16/2412/19/24

All Science Journal Classification (ASJC) codes

  • Control and Systems Engineering
  • Modeling and Simulation
  • Control and Optimization

Fingerprint

Dive into the research topics of 'DASA: Delay-Adaptive Multi-Agent Stochastic Approximation'. Together they form a unique fingerprint.

Cite this