Reducing GPU offload latency via fine-grained CPU-GPU synchronization

Research output: Chapter in Book/Report/Conference proceedingConference contribution

85 Scopus citations

Abstract

GPUs are seeing increasingly widespread use for general purpose computation due to their excellent performance for highly-parallel, throughput-oriented applications. For many workloads, however, the performance benefits of offloading are hindered by the large and unpredictable overheads of launching GPU kernels and of transferring data between CPU and GPU.

Original languageEnglish (US)
Title of host publication19th IEEE International Symposium on High Performance Computer Architecture, HPCA 2013
Pages354-365
Number of pages12
DOIs
StatePublished - 2013
Event19th IEEE International Symposium on High Performance Computer Architecture, HPCA 2013 - Shenzhen, China
Duration: Feb 23 2013Feb 27 2013

Publication series

NameProceedings - International Symposium on High-Performance Computer Architecture
ISSN (Print)1530-0897

Other

Other19th IEEE International Symposium on High Performance Computer Architecture, HPCA 2013
Country/TerritoryChina
CityShenzhen
Period2/23/132/27/13

All Science Journal Classification (ASJC) codes

  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'Reducing GPU offload latency via fine-grained CPU-GPU synchronization'. Together they form a unique fingerprint.

Cite this