Rethinking PGD Attack: Is Sign Function Necessary?

TOP 文献データベース Rethinking PGD Attack: Is Sign Function Necessary?

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2312.01260

PDF

https://arxiv.org/pdf/2312.01260

文献情報

作者: Junjie Yang;Tianlong Chen;Xuxi Chen;Zhangyang Wang;Yingbin Liang
公開日: 2023-12-3
更新日: 2024-5-21
所属機関: The Ohio State University
所属の国: United States of America
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

ロバスト性評価ポイズニング敵対的攻撃

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Neural networks have demonstrated success in various domains, yet their performance can be significantly degraded by even a small input perturbation. Consequently, the construction of such perturbations, known as adversarial attacks, has gained significant attention, many of which fall within "white-box" scenarios where we have full access to the neural network. Existing attack algorithms, such as the projected gradient descent (PGD), commonly take the sign function on the raw gradient before updating adversarial inputs, thereby neglecting gradient magnitude information. In this paper, we present a theoretical analysis of how such sign-based update algorithm influences step-wise attack performance, as well as its caveat. We also interpret why previous attempts of directly using raw gradients failed. Based on that, we further propose a new raw gradient descent (RGD) algorithm that eliminates the use of sign. Specifically, we convert the constrained optimization problem into an unconstrained one, by introducing a new hidden variable of non-clipped perturbation that can move beyond the constraint. The effectiveness of the proposed RGD algorithm has been demonstrated extensively in experiments, outperforming PGD and other competitors in various settings, without incurring any additional computational overhead. The codes is available in https://github.com/JunjieYang97/RGD.

外部データセット

CIFAR-10

CIFAR-100

ImageNet