AI Accelerators for Cloud and Server Applications

Rakesh Shrestha, Rojeena Bajracharya, Ashutosh Mishra, Shiho Kim

Research output: Chapter in Book/Report/Conference proceedingChapter

2 Citations (Scopus)

Abstract

AI accelerator is a specialized hardware processing unit that provides high throughput, lower latency, and higher energy efficiency compared to existing server-based processors available in the market. Some AI accelerators are NPU, GPU, FPGA, and ASIC. As compared to other accelerators, ASICs are much more efficient technology as they consume very low power and can be readily customized for specific activities. The AI accelerators can be used in cloud servers as well as at the edge devices. Nowadays, the cloud provides an ideal environment for Machine Learning as it gathers a massive amount of data from various sources. At the same time, edge computing or in-device computing is the ideal option for inference that requires quick output. AI accelerator architecture is necessary for advanced data centers to address the ever-increasing demands of processing and handling massive datasets workloads such as machine vision, deep learning, AI, etc. Moreover, it is necessary to consider the servers’ power consumed and the data center’s power budget while designing the AI accelerators. This chapter discusses various AI accelerators in the cloud, data centers, servers, and edge computing.

Original languageEnglish
Title of host publicationArtificial Intelligence and Hardware Accelerators
PublisherSpringer International Publishing
Pages95-125
Number of pages31
ISBN (Electronic)9783031221705
ISBN (Print)9783031221699
DOIs
Publication statusPublished - 2023 Jan 1

Bibliographical note

Publisher Copyright:
© The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland AG 2023.

All Science Journal Classification (ASJC) codes

  • General Engineering
  • General Computer Science

Fingerprint

Dive into the research topics of 'AI Accelerators for Cloud and Server Applications'. Together they form a unique fingerprint.

Cite this