Abstract
GPUs, which are widely used high-performance hardware accelerators in heterogeneous computing, and programming models for architectures such as OpenCL and CUDA, have recently been developed to achieve high productivity. LLVM is an open-source compiler infrastructure that enables low-level optimization through LLVM intermediate representation (LLVM IR) in various programming language environments. In this paper, we propose a fully-automatic Dynamic Profiling framework which performs instruction-level analysis through IR-level code instrumentation for typical OpenCL workload kernels.
Original language | English |
---|---|
Title of host publication | Proceedings of TENCON 2018 - 2018 IEEE Region 10 Conference |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 1520-1524 |
Number of pages | 5 |
ISBN (Electronic) | 9781538654576 |
DOIs | |
Publication status | Published - 2018 Jul 2 |
Event | 2018 IEEE Region 10 Conference, TENCON 2018 - Jeju, Korea, Republic of Duration: 2018 Oct 28 → 2018 Oct 31 |
Publication series
Name | IEEE Region 10 Annual International Conference, Proceedings/TENCON |
---|---|
Volume | 2018-October |
ISSN (Print) | 2159-3442 |
ISSN (Electronic) | 2159-3450 |
Conference
Conference | 2018 IEEE Region 10 Conference, TENCON 2018 |
---|---|
Country/Territory | Korea, Republic of |
City | Jeju |
Period | 18/10/28 → 18/10/31 |
Bibliographical note
Publisher Copyright:© 2018 IEEE.
All Science Journal Classification (ASJC) codes
- Computer Science Applications
- Electrical and Electronic Engineering