Abstract
To cope with the lack of on-device machine learning samples, this article presents a distributed data augmentation algorithm, coined federated data augmentation (FAug). In FAug, devices share a tiny fraction of their local data, i.e., seed samples, and collectively train a synthetic sample generator that can augment the local datasets of devices. To further improve FAug, we introduce a multihop-based seed sample collection method and an oversampling technique that mixes up collected seed samples. Both approaches enjoy the benefit from the crowd of devices, by hiding data privacy from preceding hops and feeding diverse seed samples. In the image classification tasks, simulations demonstrate that the proposed FAug frameworks yield stronger privacy guarantees, lower communication latency, and higher on-device ML accuracy.
| Original language | English |
|---|---|
| Pages (from-to) | 80-87 |
| Number of pages | 8 |
| Journal | IEEE Intelligent Systems |
| Volume | 36 |
| Issue number | 5 |
| DOIs | |
| Publication status | Published - 2021 |
Bibliographical note
Publisher Copyright:© 2001-2011 IEEE.
All Science Journal Classification (ASJC) codes
- Computer Networks and Communications
- Artificial Intelligence
Fingerprint
Dive into the research topics of 'Hiding in the Crowd: Federated Data Augmentation for On-Device Learning'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver