Memory-efficient DNN training on mobile devices

In Gim, Jeong Gil Ko

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

On-device deep neural network (DNN) training holds the potential to enable a rich set of privacy-aware and infrastructure-independent personalized mobile applications. However, despite advancements in mobile hardware, locally training a complex DNN is still a nontrivial task given its resource demands. In this work, we show that the limited memory resources on mobile devices are the main constraint and propose Sage as a framework for efficiently optimizing memory resources for on-device DNN training. Specifically, Sage configures a flexible computation graph for DNN gradient evaluation and reduces the memory footprint of the graph using operator- and graph-level optimizations. In run-time, Sage employs a hybrid of gradient checkpointing and micro-batching techniques to dynamically adjust its memory use to the available system memory budget. Using implementation on off-the-shelf smartphones, we show that Sage enables local training of complex DNN models by reducing memory use by more than 20-fold compared to a baseline approach. We also show that Sage successfully adapts to run-time memory budget variations, and evaluate its energy consumption to show Sage's practical applicability.

Original languageEnglish
Title of host publicationMobiSys 2022 - Proceedings of the 2022 20th Annual International Conference on Mobile Systems, Applications and Services
PublisherAssociation for Computing Machinery, Inc
Pages464-476
Number of pages13
ISBN (Electronic)9781450391856
DOIs
Publication statusPublished - 2022 Jun 27
Event20th ACM International Conference on Mobile Systems, Applications and Services, MobiSys 2022 - Portland, United States
Duration: 2022 Jun 272022 Jul 1

Publication series

NameMobiSys 2022 - Proceedings of the 2022 20th Annual International Conference on Mobile Systems, Applications and Services

Conference

Conference20th ACM International Conference on Mobile Systems, Applications and Services, MobiSys 2022
Country/TerritoryUnited States
CityPortland
Period22/6/2722/7/1

Bibliographical note

Publisher Copyright:
© 2022 ACM.

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Memory-efficient DNN training on mobile devices'. Together they form a unique fingerprint.

Cite this