Abstract
The advancements in AI/ML accelerators have made the core AI/ML computation relatively insignificant in application pipelines. For example, inferencing only accounts for 3% of the latency in an image-based ML pipeline with the help of Tensor Cores. The mismatch in performance growth between ML model computation and ML-adjacent computation, the producer and consumer of ML models, will become the bottleneck leading to system inefficiency. This paper presents a set of innovative algorithms to allow the entire ML-based computer vision pipelines to leverage AI/ML accelerators. Our proposed algorithms feature matrix-based operations that AI/ML accelerators specialize in. Simply compiler optimizations cannot take full advantage of hardware acceleration without revisiting algorithms. This paper implements the proposed algorithms as an open-source library, TensorCV, in a system platform with Tensor Cores. TensorCV shows a 6.12 × speedup in optimized ML-adjacent functions and saves 81 % energy consumption on modern heterogeneous computers. The code is available at https://github.com/escalab/TensorCV.
Original language | English |
---|---|
Title of host publication | 2023 IEEE/ACM International Symposium on Low Power Electronics and Design, ISLPED 2023 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
ISBN (Electronic) | 9798350311754 |
DOIs | |
Publication status | Published - 2023 |
Event | 2023 IEEE/ACM International Symposium on Low Power Electronics and Design, ISLPED 2023 - Vienna, Austria Duration: 2023 Aug 7 → 2023 Aug 8 |
Publication series
Name | Proceedings of the International Symposium on Low Power Electronics and Design |
---|---|
Volume | 2023-August |
ISSN (Print) | 1533-4678 |
Conference
Conference | 2023 IEEE/ACM International Symposium on Low Power Electronics and Design, ISLPED 2023 |
---|---|
Country/Territory | Austria |
City | Vienna |
Period | 23/8/7 → 23/8/8 |
Bibliographical note
Publisher Copyright:© 2023 IEEE.
All Science Journal Classification (ASJC) codes
- General Engineering