A comprehensive survey on coded distributed computing: fundamentals, challenges, and networking applications

Jer Shyuan Ng, Wei Yang Bryan Lim, Nguyen Cong Luong*, Zehui Xiong, Alia Asheralieva, Dusit Niyato, Cyril Leung, Chunyan Miao

*Corresponding author for this work

Research output: Contribution to journalReview articlepeer-review

102 Citations (Scopus)

Abstract

Distributed computing has become a common approach for large-scale computation tasks due to benefits such as high reliability, scalability, computation speed, and cost-effectiveness. However, distributed computing faces critical issues related to communication load and straggler effects. In particular, computing nodes need to exchange intermediate results with each other in order to calculate the final result, and this significantly increases communication overheads. Furthermore, a distributed computing network may include straggling nodes that run intermittently slower. This results in a longer overall time needed to execute the computation tasks, thereby limiting the performance of distributed computing. To address these issues, coded distributed computing (CDC), i.e., a combination of coding theoretic techniques and distributed computing, has been recently proposed as a promising solution. Coding theoretic techniques have proved effective in WiFi and cellular systems to deal with channel noise. Therefore, CDC may significantly reduce communication load, alleviate the effects of stragglers, provide fault-tolerance, privacy and security. In this survey, we first introduce the fundamentals of CDC, followed by basic CDC schemes. Then, we review and analyze a number of CDC approaches proposed to reduce the communication costs, mitigate the straggler effects, and guarantee privacy and security. Furthermore, we present and discuss applications of CDC in modern computer networks. Finally, we highlight important challenges and promising research directions related to CDC.

Original languageEnglish
Article number9463425
Pages (from-to)1800-1837
Number of pages38
JournalIEEE Communications Surveys and Tutorials
Volume23
Issue number3
DOIs
Publication statusPublished - 23 Jul 2021
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2021 IEEE.

Keywords

  • Coded distributed computing
  • Communication minimization
  • Distributed computing
  • Security
  • Straggler effects mitigation

ASJC Scopus subject areas

  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'A comprehensive survey on coded distributed computing: fundamentals, challenges, and networking applications'. Together they form a unique fingerprint.

Cite this