TY - GEN
T1 - Reducing GPU offload latency via fine-grained CPU-GPU synchronization
AU - Lustig, Daniel
AU - Martonosi, Margaret Rose
PY - 2013
Y1 - 2013
N2 - GPUs are seeing increasingly widespread use for general purpose computation due to their excellent performance for highly-parallel, throughput-oriented applications. For many workloads, however, the performance benefits of offloading are hindered by the large and unpredictable overheads of launching GPU kernels and of transferring data between CPU and GPU.
AB - GPUs are seeing increasingly widespread use for general purpose computation due to their excellent performance for highly-parallel, throughput-oriented applications. For many workloads, however, the performance benefits of offloading are hindered by the large and unpredictable overheads of launching GPU kernels and of transferring data between CPU and GPU.
UR - http://www.scopus.com/inward/record.url?scp=84880254195&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84880254195&partnerID=8YFLogxK
U2 - 10.1109/HPCA.2013.6522332
DO - 10.1109/HPCA.2013.6522332
M3 - Conference contribution
AN - SCOPUS:84880254195
SN - 9781467355858
T3 - Proceedings - International Symposium on High-Performance Computer Architecture
SP - 354
EP - 365
BT - 19th IEEE International Symposium on High Performance Computer Architecture, HPCA 2013
T2 - 19th IEEE International Symposium on High Performance Computer Architecture, HPCA 2013
Y2 - 23 February 2013 through 27 February 2013
ER -