Temporal shuffling for defending deep action recognition models against adversarial attacks

Jaehui Hwang, Huan Zhang, Jun Ho Choi, Cho Jui Hsieh, Jong Seok Lee

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)

Abstract

Recently, video-based action recognition methods using convolutional neural networks (CNNs) achieve remarkable recognition performance. However, there is still lack of understanding about the generalization mechanism of action recognition models. In this paper, we suggest that action recognition models rely on the motion information less than expected, and thus they are robust to randomization of frame orders. Furthermore, we find that motion monotonicity remaining after randomization also contributes to such robustness. Based on this observation, we develop a novel defense method using temporal shuffling of input videos against adversarial attacks for action recognition models. Another observation enabling our defense method is that adversarial perturbations on videos are sensitive to temporal destruction. To the best of our knowledge, this is the first attempt to design a defense method without additional training for 3D CNN-based video action recognition models.

Original languageEnglish
Pages (from-to)388-397
Number of pages10
JournalNeural Networks
Volume169
DOIs
Publication statusPublished - 2024 Jan

Bibliographical note

Publisher Copyright:
© 2023 Elsevier Ltd

All Science Journal Classification (ASJC) codes

  • Cognitive Neuroscience
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Temporal shuffling for defending deep action recognition models against adversarial attacks'. Together they form a unique fingerprint.

Cite this