Abstract

Increasingly high performance computing (HPC) application developers are opting to use cloud resources due to higher availability. Virtualized GPUs would be an obvious and attractive option for HPC application developers using cloud hosting services. Unfortunately, existing GPU virtualization software is not ready to address fairness, utilization, and performance limitations associated with consolidating mixed HPC workloads. This paper presents FairGV, a radically redesigned GPU virtualization system that achieves system-wide weighted fair sharing and strong performance isolation in mixed workloads that use GPUs with variable degrees of intensity. To achieve its objectives, FairGV introduces a trap-less GPU processing architecture, a new fair queuing method integrated with work-conserving and GPU-centric coscheduling polices, and a collaborative scheduling method for non-preemptive GPUs. Our prototype implementation achieves near ideal fairness (≥ 0.97 Min-Max Ratio) with little performance degradation (≤ 1.02 aggregated overhead) in a range of mixed HPC workloads that leverage GPUs.
Original languageEnglish
Pages (from-to)3472-3485
JournalIEEE Transactions on Parallel and Distributed Systems
Volume28
Issue number12
DOIs
Publication statusPublished - 01 Dec 2017

Keywords

  • GPU virtualization
  • trap-less architecture
  • fair queuing
  • coscheduling and hybrid scheduling strategies

Fingerprint Dive into the research topics of 'FairGV: Fair and Fast GPU Virtualization'. Together they form a unique fingerprint.

Cite this