TY - JOUR
T1 - Intelligent User Association for Symbiotic Radio Networks Using Deep Reinforcement Learning
AU - Zhang, Qianqian
AU - Liang, Ying Chang
AU - Poor, H. Vincent
N1 - Funding Information:
Manuscript received May 6, 2019; revised October 15, 2019 and February 10, 2020; accepted March 25, 2020. Date of publication April 7, 2020; date of current version July 10, 2020. This work was supported in part by the National Natural Science Foundation of China under Grant 61631005 and Grant U1801261, in part by the National Key Research and Development Program of China under Grant 2018YFB1801105, in part by the Central Universities under Grant ZYGX2019Z022, in part by the 111 Project under Grant B20064, and in part by the U.S. National Science Foundation under Grant CCF-0939370, Grant CCF-1513915, and Grant CCF-1908308. The associate editor coordinating the review of this article and approving it for publication was X. Cheng. (Corresponding author: Ying-Chang Liang.) Qianqian Zhang is with the National Key Laboratory of Science and Technology on Communications, University of Electronic Science and Technology of China (UESTC), Chengdu 611731, China, and also with the Center for Intelligent Networking and Communications (CINC), University of Electronic Science and Technology of China (UESTC), Chengdu 611731, China (e-mail: qqzhang_kite@163.com).
Publisher Copyright:
© 2002-2012 IEEE.
PY - 2020/7
Y1 - 2020/7
N2 - In this paper, we are interested in symbiotic radio networks (SRNs), in which an Internet-of-Things (IoT) network parasitizes in a primary cellular network to achieve spectrum-, energy-, and infrastructure-efficient communications. Each IoT device transmits its own information by backscattering the signals from the primary network without using active radio-frequency (RF) transmitter chain. We consider the symbiosis between the cellular network and the IoT network and focus on the user association problem in SRN. Specifically, the base station (BS) in the primary network serves multiple cellular users using time division multiple access (TDMA) and each IoT device is associated with one cellular user for information transmission. The objective of user association is to link each IoT device to an appropriate cellular user by maximizing the sum rate of all IoT devices. However, the difficulty in obtaining the full real-time channel information makes it difficult to design an optimal policy for this problem. To overcome this issue, we propose two deep reinforcement learning (DRL) algorithms, both use historical information to infer the current information in order to make appropriate decisions. One algorithm, referred to as centralized DRL, makes decisions for all IoT devices at one time with globally available information. The other algorithm, referred to as distributed DRL, makes a decision only for one IoT device at one time using locally available information. Finally, simulation results show that the two proposed DRL algorithms achieve performance comparable to the optimal user association policy which requires perfect real-time information, and the distributed DRL algorithm has the advantage of scalability.
AB - In this paper, we are interested in symbiotic radio networks (SRNs), in which an Internet-of-Things (IoT) network parasitizes in a primary cellular network to achieve spectrum-, energy-, and infrastructure-efficient communications. Each IoT device transmits its own information by backscattering the signals from the primary network without using active radio-frequency (RF) transmitter chain. We consider the symbiosis between the cellular network and the IoT network and focus on the user association problem in SRN. Specifically, the base station (BS) in the primary network serves multiple cellular users using time division multiple access (TDMA) and each IoT device is associated with one cellular user for information transmission. The objective of user association is to link each IoT device to an appropriate cellular user by maximizing the sum rate of all IoT devices. However, the difficulty in obtaining the full real-time channel information makes it difficult to design an optimal policy for this problem. To overcome this issue, we propose two deep reinforcement learning (DRL) algorithms, both use historical information to infer the current information in order to make appropriate decisions. One algorithm, referred to as centralized DRL, makes decisions for all IoT devices at one time with globally available information. The other algorithm, referred to as distributed DRL, makes a decision only for one IoT device at one time using locally available information. Finally, simulation results show that the two proposed DRL algorithms achieve performance comparable to the optimal user association policy which requires perfect real-time information, and the distributed DRL algorithm has the advantage of scalability.
KW - Symbiotic radio network (SRN)
KW - ambient backscatter communication (AmBC)
KW - deep rein-forcement learning
KW - user association
UR - http://www.scopus.com/inward/record.url?scp=85089303136&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85089303136&partnerID=8YFLogxK
U2 - 10.1109/TWC.2020.2984758
DO - 10.1109/TWC.2020.2984758
M3 - Article
AN - SCOPUS:85089303136
SN - 1536-1276
VL - 19
SP - 4535
EP - 4548
JO - IEEE Transactions on Wireless Communications
JF - IEEE Transactions on Wireless Communications
IS - 7
M1 - 9058982
ER -