TY - GEN
T1 - A Reinforcement Learning-Based Approach to Graph Discovery in D2D-Enabled Federated Learning
AU - Wagle, Satyavrat
AU - Das, Anindya Bijoy
AU - Love, David J.
AU - Brinton, Christopher G.
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Augmenting federated learning (FL) with direct device-to-device (D2D) communications can help improve conver-gence speed and reduce model bias through rapid local information exchange. However, data privacy concerns, device trust issues, and unreliable wireless channels each pose challenges to determining an effective yet resource efficient D2D structure. In this paper, we develop a decentralized reinforcement learning (RL) methodology for D2D graph discovery that promotes communication of non-sensitive yet impactful data-points over trusted yet reliable links. Each device functions as an RL agent, training a policy to predict the impact of incoming links. Local (device-level) and global rewards are coupled through message passing within and between device clusters. Numerical experiments confirm the advantages offered by our method in terms of convergence speed and straggler resilience across several datasets and FL schemes.
AB - Augmenting federated learning (FL) with direct device-to-device (D2D) communications can help improve conver-gence speed and reduce model bias through rapid local information exchange. However, data privacy concerns, device trust issues, and unreliable wireless channels each pose challenges to determining an effective yet resource efficient D2D structure. In this paper, we develop a decentralized reinforcement learning (RL) methodology for D2D graph discovery that promotes communication of non-sensitive yet impactful data-points over trusted yet reliable links. Each device functions as an RL agent, training a policy to predict the impact of incoming links. Local (device-level) and global rewards are coupled through message passing within and between device clusters. Numerical experiments confirm the advantages offered by our method in terms of convergence speed and straggler resilience across several datasets and FL schemes.
UR - http://www.scopus.com/inward/record.url?scp=85185965122&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85185965122&partnerID=8YFLogxK
U2 - 10.1109/GLOBECOM54140.2023.10437633
DO - 10.1109/GLOBECOM54140.2023.10437633
M3 - Conference contribution
AN - SCOPUS:85185965122
T3 - Proceedings - IEEE Global Communications Conference, GLOBECOM
SP - 225
EP - 230
BT - GLOBECOM 2023 - 2023 IEEE Global Communications Conference
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2023 IEEE Global Communications Conference, GLOBECOM 2023
Y2 - 4 December 2023 through 8 December 2023
ER -