Automatically Discovering Novel Visual Categories with Adaptive Prototype Learning

Lu Zhang, Lu Qi, Xu Yang, Hong Qiao, Ming Hsuan Yang, Zhiyong Liu

Research output: Contribution to journalArticlepeer-review

Abstract

This article targets the task of novel category discovery (NCD), which aims to discover unknown categories when a certain number of classes are already known. The NCD task is challenging due to its closeness to real-world scenarios, where we have only encountered some partial classes and corresponding images. Unlike previous approaches to NCD, we propose a novel adaptive prototype learning method that leverages prototypes to emphasize category discrimination and alleviate the issue of missing annotations for novel classes. Concretely, the proposed method consists of two main stages: prototypical representation learning and prototypical self-training. In the first stage, we develop a robust feature extractor that could effectively handle images from both base and novel categories. This ability of instance and category discrimination of the feature extractor is boosted by self-supervised learning and adaptive prototypes. In the second stage, we utilize the prototypes again to rectify offline pseudo labels and train a final parametric classifier for category clustering. We conduct extensive experiments on four benchmark datasets, demonstrating our method's effectiveness and robustness with state-of-the-art performance.

Original languageEnglish
Pages (from-to)2533-2544
Number of pages12
JournalIEEE transactions on pattern analysis and machine intelligence
Volume46
Issue number4
DOIs
Publication statusPublished - 2024 Apr 1

Bibliographical note

Publisher Copyright:
© 1979-2012 IEEE.

All Science Journal Classification (ASJC) codes

  • Software
  • Computer Vision and Pattern Recognition
  • Computational Theory and Mathematics
  • Artificial Intelligence
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'Automatically Discovering Novel Visual Categories with Adaptive Prototype Learning'. Together they form a unique fingerprint.

Cite this