Accelerating forwarding computation of artificial neural network using CUDA

Jong Hyun Park, Won Woo Ro

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

Recently, graphics processing units (GPUs) are widely used for accelerating general purpose workloads using programming models such as open computing language (OpenCL) or compute unified device architecture (CUDA). In this paper, we accelerated the Artificial Neural Network (ANN) algorithm, one of the popular algorithm in machine learning and cognitive science, since the ANN algorithm needs to be faster for solving more complex problem or operating in real-time. The ANN algorithm has great potential for GPU acceleration since it is constructed with large data-parallel computations. We implemented forwarding computation of ANN in CUDA and optimized it using scratchpad memory of GPUs and leveraging the thread block size. As a results, our method shows 2.32 times faster performance compared to conventional CPU.

Original languageEnglish
Title of host publicationInternational Conference on Electronics, Information, and Communications, ICEIC 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781467380164
DOIs
Publication statusPublished - 2016 Sept 7
Event15th International Conference on Electronics, Information, and Communications, ICEIC 2016 - Danang, Viet Nam
Duration: 2016 Jan 272016 Jan 30

Publication series

NameInternational Conference on Electronics, Information, and Communications, ICEIC 2016

Other

Other15th International Conference on Electronics, Information, and Communications, ICEIC 2016
Country/TerritoryViet Nam
CityDanang
Period16/1/2716/1/30

Bibliographical note

Publisher Copyright:
© 2016 IEEE.

All Science Journal Classification (ASJC) codes

  • Electrical and Electronic Engineering
  • Control and Systems Engineering

Fingerprint

Dive into the research topics of 'Accelerating forwarding computation of artificial neural network using CUDA'. Together they form a unique fingerprint.

Cite this