GenLabel: Mixup Relabeling using Generative Models

Jy Yong Sohn, Liang Shang, Hongxu Chen, Jaekyun Moon, Dimitris Papailiopoulos, Kangwook Lee

Research output: Contribution to journalConference articlepeer-review

5 Citations (Scopus)

Abstract

Mixup is a data augmentation method that generates new data points by mixing a pair of input data. While mixup generally improves the prediction performance, it sometimes degrades the performance. In this paper, we first identify the main causes of this phenomenon by theoretically and empirically analyzing the mixup algorithm. To resolve this, we propose GenLabel, a simple yet effective relabeling algorithm designed for mixup. In particular, GenLabel helps the mixup algorithm correctly label mixup samples by learning the class-conditional data distribution using generative models. Via theoretical and empirical analysis, we show that mixup, when used together with GenLabel, can effectively resolve the aforementioned phenomenon, improving the accuracy of mixup-trained model.

Original languageEnglish
Pages (from-to)20278-20313
Number of pages36
JournalProceedings of Machine Learning Research
Volume162
Publication statusPublished - 2022
Event39th International Conference on Machine Learning, ICML 2022 - Baltimore, United States
Duration: 2022 Jul 172022 Jul 23

Bibliographical note

Publisher Copyright:
Copyright © 2022 by the author(s)

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Software
  • Control and Systems Engineering
  • Statistics and Probability

Fingerprint

Dive into the research topics of 'GenLabel: Mixup Relabeling using Generative Models'. Together they form a unique fingerprint.

Cite this