Deep reinforcement learning based big data resource management for 5G/6G communications

  • Zhaoyuan Shi
  • , Xianzhong Xie
  • , Sahil Garg
  • , Huabing Lu
  • , Helin Yang
  • , Zehui Xiong

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

With the advent of the Internet of Everything era, communication data has exploded, which requires more communication resources, such as frequency, time, and energy. In this context, this paper presents a machine learning-based data packet scheduling scheme to achieve efficient data packet transmission in the 5G/6G communication systems. To minimize the average number of packet overflows (APNO), we propose distributed deep deterministic policy gradient (DDPG)-based algorithm for multidimensional resource scheduling. To improve the algorithm stability and training efficiency, the strategy of centralized training and distributed execution is adopted, and an Action Adjuster is designed. The proposed algorithm enables the multidimensional resource management of the 5G/6G commu-nication systems without any information interaction between each agent. Simulation results show that the proposed Action Adjuster DDPG algorithm achieves faster convergence and less data overflow compared to other benchmark algorithms.

Original languageEnglish
Title of host publication2021 IEEE Global Communications Conference (GLOBECOM): proceedings
PublisherIEEE
Number of pages6
ISBN (Electronic)9781728181042
ISBN (Print)9781728181059
DOIs
Publication statusPublished - 02 Feb 2021
Externally publishedYes
Event2021 IEEE Global Communications Conference, GLOBECOM 2021 - Madrid, Spain
Duration: 07 Dec 202111 Dec 2021

Publication series

NameProceedings - IEEE Global Communications Conference, GLOBECOM
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISSN (Print)2334-0983
ISSN (Electronic)2576-6813

Conference

Conference2021 IEEE Global Communications Conference, GLOBECOM 2021
Country/TerritorySpain
CityMadrid
Period07/12/202111/12/2021

Bibliographical note

Publisher Copyright:
© 2021 IEEE.

Keywords

  • Action Adjuster(AA)
  • deep deterministic policy gradient(DDPG)
  • multidimensional resource management

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Hardware and Architecture
  • Signal Processing

Fingerprint

Dive into the research topics of 'Deep reinforcement learning based big data resource management for 5G/6G communications'. Together they form a unique fingerprint.

Cite this