A Closer Look at the Adversarial Robustness of Deep Equilibrium Models

TOP Literature Database A Closer Look at the Adversarial Robustness of Deep Equilibrium Models

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2306.01429

PDF

https://arxiv.org/pdf/2306.01429

Paper Information

Author: Zonghan Yang;Tianyu Pang;Yang Liu
Published: 6-2-2023
Affiliation: Department of Computer Science and Technology, Tsinghua University
Country: China
Conference: Computing Research Repository (CoRR)

Labels Estimated by AI

Adversarial attack Robustness Evaluation Adaptive Adversarial Training

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Deep equilibrium models (DEQs) refrain from the traditional layer-stacking paradigm and turn to find the fixed point of a single layer. DEQs have achieved promising performance on different applications with featured memory efficiency. At the same time, the adversarial vulnerability of DEQs raises concerns. Several works propose to certify robustness for monotone DEQs. However, limited efforts are devoted to studying empirical robustness for general DEQs. To this end, we observe that an adversarially trained DEQ requires more forward steps to arrive at the equilibrium state, or even violates its fixed-point structure. Besides, the forward and backward tracks of DEQs are misaligned due to the black-box solvers. These facts cause gradient obfuscation when applying the ready-made attacks to evaluate or adversarially train DEQs. Given this, we develop approaches to estimate the intermediate gradients of DEQs and integrate them into the attacking pipelines. Our approaches facilitate fully white-box evaluations and lead to effective adversarial defense for DEQs. Extensive experiments on CIFAR-10 validate the adversarial robustness of DEQs competitive with deep networks of similar sizes.

External Datasets

CIFAR-10