Extending rCUDA with Support for P2P Memory Copies between Remote GPUs

Carlos Reano, Federico Silla

Research output: Chapter in Book/Report/Conference proceedingChapter

2 Citations (Scopus)

Abstract

Although GPUs are being widely adopted in order to noticeably reduce the execution time of many applications, their use presents several side effects such as an increased acquisition cost of the cluster nodes or an increased overall energy consumption. To address these concerns, GPU virtualization frameworks could be used. These frameworks allow accelerated applications to transparently use GPUs located in cluster nodes other than the one executing the program. Furthermore, these frameworks aim to offer the same API as the NVIDIA CUDA Runtime API does, although different frameworks provide different degree of support. In general, and because of the complexity of implementing an efficient mechanism, none of the existing frameworks provides support for memory copies between remote GPUs located in different nodes. In this paper we introduce an efficient mechanism devised for addressing the support for this kind of memory copies among GPUs located in different cluster nodes. Several options are explored and analyzed, such as the use of the GPUDirect RDMA mechanism. We focus our discussion on the rCUDA remote GPU virtualization framework. Results show that is is possible to implement this kind of memory copies in such an efficient way that performance is not reduced with respect to the original performance attained by CUDA when GPUs located in the same cluster node are leveraged. © 2016 IEEE.
Original languageEnglish
Title of host publicationProceedings - 18th IEEE International Conference on High Performance Computing and Communications, 14th IEEE International Conference on Smart City and 2nd IEEE International Conference on Data Science and Systems, HPCC/SmartCity/DSS 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages789-796
Number of pages8
ISBN (Print)9781509042968
DOIs
Publication statusPublished - 20 Jan 2017

Publication series

NameProceedings - 18th IEEE International Conference on High Performance Computing and Communications, 14th IEEE International Conference on Smart City and 2nd IEEE International Conference on Data Science and Systems, HPCC/SmartCity/DSS 2016

Keywords

  • CUDA
  • GPUDirect RDMA
  • Virtualization

Fingerprint Dive into the research topics of 'Extending rCUDA with Support for P2P Memory Copies between Remote GPUs'. Together they form a unique fingerprint.

  • Cite this

    Reano, C., & Silla, F. (2017). Extending rCUDA with Support for P2P Memory Copies between Remote GPUs. In Proceedings - 18th IEEE International Conference on High Performance Computing and Communications, 14th IEEE International Conference on Smart City and 2nd IEEE International Conference on Data Science and Systems, HPCC/SmartCity/DSS 2016 (pp. 789-796). (Proceedings - 18th IEEE International Conference on High Performance Computing and Communications, 14th IEEE International Conference on Smart City and 2nd IEEE International Conference on Data Science and Systems, HPCC/SmartCity/DSS 2016). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/HPCC-SmartCity-DSS.2016.0114