SnuRHAC: A Runtime for Heterogeneous Accelerator Clusters with CUDA Unified Memory

Jaehoon Jung, Daeyoung Park, Gangwon Jo, Jungho Park, Jaejin Lee

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

This paper proposes a framework called SnuRHAC, which provides an illusion of a single GPU for the multiple GPUs in a cluster. Under SnuRHAC, a CUDA program designed to use a single GPU can utilize multiple GPUs in a cluster without any source code modification. SnuRHAC automatically distributes workload to multiple GPUs in a cluster and manages data across the nodes. To manage data efficiently, SnuRHAC extends CUDA Unified Memory and exploits its page fault mechanism. We also propose two prefetching techniques to fully exploit UM and to maximize performance. Static prefetching allows SnuRHAC to prefetch data by statically analyzing CUDA kernels. Dynamic prefetching complements static prefetching. SnuRHAC enforces an application to run on a single GPU if it is not suitable for multiple GPUs. We evaluate the performance of SnuRHAC using 18 benchmark applications from various sources. The evaluation result shows that while SnuRHAC significantly improves ease-of-programming, it shows scalable performance for the cluster environment depending on the application characteristics.

Original languageEnglish
Title of host publicationHPDC 2021 - Proceedings of the 30th International Symposium on High-Performance Parallel and Distributed Computing
PublisherAssociation for Computing Machinery, Inc
Pages107-120
Number of pages14
ISBN (Electronic)9781450382175
DOIs
StatePublished - 21 Jun 2021
Externally publishedYes
Event30th International Symposium on High-Performance Parallel and Distributed Computing, HPDC 2021 - Virtual, Online, Sweden
Duration: 21 Jun 202125 Jun 2021

Publication series

NameHPDC 2021 - Proceedings of the 30th International Symposium on High-Performance Parallel and Distributed Computing

Conference

Conference30th International Symposium on High-Performance Parallel and Distributed Computing, HPDC 2021
Country/TerritorySweden
CityVirtual, Online
Period21/06/2125/06/21

Bibliographical note

Publisher Copyright:
© 2020 Owner/Author.

Keywords

  • cuda
  • device driver
  • gpu
  • heterogeneous computing
  • runtime system
  • single device image
  • unified memory

Fingerprint

Dive into the research topics of 'SnuRHAC: A Runtime for Heterogeneous Accelerator Clusters with CUDA Unified Memory'. Together they form a unique fingerprint.

Cite this