TY - JOUR
T1 - A Machine Learning Approach for Task and Resource Allocation in Mobile-Edge Computing-Based Networks
AU - Wang, Sihua
AU - Chen, Mingzhe
AU - Liu, Xuanlin
AU - Yin, Changchuan
AU - Cui, Shuguang
AU - Vincent Poor, H.
N1 - Funding Information:
Manuscript received February 10, 2020; revised July 9, 2020; accepted July 17, 2020. Date of publication July 22, 2020; date of current version January 22, 2021. This work was supported in part by the National Key Research and Development Program of China under Grant 2018YFB1800802, in part by the Natural Science Foundation of China under Grant 61629101 and Grant 61671086, in part by the Key Area Research and Development Program of Guangdong Province under Grant 2018B030338001, in part by the Guangdong Research Project under Grant 2017ZT07X152, in part by the Beijing Natural Science Foundation under Grant KZ201911232046, in part by the 111 Project under Grant B17007, and in part by the BUPT Excellent Ph.D. Students Foundation under Grant CX2020307. (Corresponding author: Changchuan Yin.) Sihua Wang, Xuanlin Liu, and Changchuan Yin are with the Beijing Laboratory of Advanced Information Network and the Beijing Key Laboratory of Network System Architecture and Convergence, Beijing University of Posts and Telecommunications, Beijing 100876, China (e-mail: sihuawang@bupt.edu.cn; xuanlin.liu@bupt.edu.cn; ccyin@ieee.org).
Publisher Copyright:
© 2014 IEEE.
PY - 2021/2/1
Y1 - 2021/2/1
N2 - In this article, a joint task, spectrum, and transmit power allocation problem is investigated for a wireless network in which the base stations (BSs) are equipped with mobile-edge computing (MEC) servers to jointly provide computational and communication services to users. Each user can request one computational task from three types of computational tasks. Since the data size of each computational task is different, as the requested computational task varies, the BSs must adjust their resource (subcarrier and transmit power) and task allocation schemes to effectively serve the users. This problem is formulated as an optimization problem whose goal is to minimize the maximal computational and transmission delay among all users. A multistack reinforcement learning (RL) algorithm is developed to solve this problem. Using the proposed algorithm, each BS can record the historical resource allocation schemes and users' information in its multiple stacks to avoid learning the same resource allocation scheme and users' states, thus improving the convergence speed and learning efficiency. The simulation results illustrate that the proposed algorithm can reduce the number of iterations needed for convergence and the maximal delay among all users by up to 18% and 11.1% compared to the standard Q -learning algorithm.
AB - In this article, a joint task, spectrum, and transmit power allocation problem is investigated for a wireless network in which the base stations (BSs) are equipped with mobile-edge computing (MEC) servers to jointly provide computational and communication services to users. Each user can request one computational task from three types of computational tasks. Since the data size of each computational task is different, as the requested computational task varies, the BSs must adjust their resource (subcarrier and transmit power) and task allocation schemes to effectively serve the users. This problem is formulated as an optimization problem whose goal is to minimize the maximal computational and transmission delay among all users. A multistack reinforcement learning (RL) algorithm is developed to solve this problem. Using the proposed algorithm, each BS can record the historical resource allocation schemes and users' information in its multiple stacks to avoid learning the same resource allocation scheme and users' states, thus improving the convergence speed and learning efficiency. The simulation results illustrate that the proposed algorithm can reduce the number of iterations needed for convergence and the maximal delay among all users by up to 18% and 11.1% compared to the standard Q -learning algorithm.
KW - Mobile-edge computing (MEC)
KW - multistack reinforcementlearning (RL)
KW - resource management
UR - http://www.scopus.com/inward/record.url?scp=85099268105&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85099268105&partnerID=8YFLogxK
U2 - 10.1109/JIOT.2020.3011286
DO - 10.1109/JIOT.2020.3011286
M3 - Article
AN - SCOPUS:85099268105
SN - 2327-4662
VL - 8
SP - 1358
EP - 1372
JO - IEEE Internet of Things Journal
JF - IEEE Internet of Things Journal
IS - 3
M1 - 9146372
ER -