TY - GEN
T1 - AoI Minimization for UAV-to-Device Underlay Communication by Multi-agent Deep Reinforcement Learning
AU - Wu, Fanyi
AU - Zhang, Hongliang
AU - Wu, Jianjun
AU - Song, Lingyang
AU - Han, Zhu
AU - Poor, H. Vincent
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/12
Y1 - 2020/12
N2 - In this paper, we consider a cellular Internet of UAVs, where the sensory data can be transmitted either to the base station via cellular links, or to the mobile devices by underlay UAV-to-Device communications. To evaluate the freshness of the sensory data, the age of information (AoI) is adopted, in which a lower AoI implies fresher data. Since UAVs' AoIs are determined by their trajectories during sensing and transmission, we aim to minimize the AoIs of UAVs by designing their trajectories. This problem is a Markov decision problem with an infinite state-action space, and thus, we propose a multi-UAV trajectory design algorithm by leveraging multi-agent deep reinforcement learning to solve it. Simulation results show that our proposed algorithm outperforms both a greedy algorithm and a policy gradient algorithm.
AB - In this paper, we consider a cellular Internet of UAVs, where the sensory data can be transmitted either to the base station via cellular links, or to the mobile devices by underlay UAV-to-Device communications. To evaluate the freshness of the sensory data, the age of information (AoI) is adopted, in which a lower AoI implies fresher data. Since UAVs' AoIs are determined by their trajectories during sensing and transmission, we aim to minimize the AoIs of UAVs by designing their trajectories. This problem is a Markov decision problem with an infinite state-action space, and thus, we propose a multi-UAV trajectory design algorithm by leveraging multi-agent deep reinforcement learning to solve it. Simulation results show that our proposed algorithm outperforms both a greedy algorithm and a policy gradient algorithm.
KW - UAV-to-Device communication
KW - cellular Internet of UAVs
KW - multi-agent deep reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=85100394064&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85100394064&partnerID=8YFLogxK
U2 - 10.1109/GLOBECOM42002.2020.9322539
DO - 10.1109/GLOBECOM42002.2020.9322539
M3 - Conference contribution
AN - SCOPUS:85100394064
T3 - 2020 IEEE Global Communications Conference, GLOBECOM 2020 - Proceedings
BT - 2020 IEEE Global Communications Conference, GLOBECOM 2020 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2020 IEEE Global Communications Conference, GLOBECOM 2020
Y2 - 7 December 2020 through 11 December 2020
ER -