Boosting CUDA applications with CPU-GPU hybrid computing

Changmin Lee, Won Woo Ro, Jean Luc Gaudiot

Research output: Contribution to journalArticlepeer-review

26 Citations (Scopus)


This paper presents a cooperative heterogeneous computing framework which enables the efficient utilization of available computing resources of host CPU cores for CUDA kernels, which are designed to run only on GPU. The proposed system exploits at runtime the coarse-grain thread-level parallelism across CPU and GPU, without any source recompilation. To this end, three features including a work distribution module, a transparent memory space, and a global scheduling queue are described in this paper. With a completely automatic runtime workload distribution, the proposed framework achieves speedups of 3.08× in the best case and 1.42× on average compared to the baseline GPU-only processing.

Original languageEnglish
Pages (from-to)384-404
Number of pages21
JournalInternational Journal of Parallel Programming
Issue number2
Publication statusPublished - 2014 Apr

Bibliographical note

Funding Information:
Acknowledgments We thank all of the anonymous reviewers for their comments. This work was supported by the Basic Science Research Program through the National Research Foundation (NRF) of Korea, which is funded by the Ministry of Education, Science and Technology [2009-0070364]. This work is also supported in part by the US National Science Foundation under Grant No. CCF-1065448. Any opinions, findings, and conclusions as well as recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation.

All Science Journal Classification (ASJC) codes

  • Software
  • Theoretical Computer Science
  • Information Systems


Dive into the research topics of 'Boosting CUDA applications with CPU-GPU hybrid computing'. Together they form a unique fingerprint.

Cite this