AIセキュリティポータル K Program
Rethinking PGD Attack: Is Sign Function Necessary?
Share
Abstract
Neural networks have demonstrated success in various domains, yet their performance can be significantly degraded by even a small input perturbation. Consequently, the construction of such perturbations, known as adversarial attacks, has gained significant attention, many of which fall within "white-box" scenarios where we have full access to the neural network. Existing attack algorithms, such as the projected gradient descent (PGD), commonly take the sign function on the raw gradient before updating adversarial inputs, thereby neglecting gradient magnitude information. In this paper, we present a theoretical analysis of how such sign-based update algorithm influences step-wise attack performance, as well as its caveat. We also interpret why previous attempts of directly using raw gradients failed. Based on that, we further propose a new raw gradient descent (RGD) algorithm that eliminates the use of sign. Specifically, we convert the constrained optimization problem into an unconstrained one, by introducing a new hidden variable of non-clipped perturbation that can move beyond the constraint. The effectiveness of the proposed RGD algorithm has been demonstrated extensively in experiments, outperforming PGD and other competitors in various settings, without incurring any additional computational overhead. The codes is available in https://github.com/JunjieYang97/RGD.
Efficient and effective augmentation strategy for adversarial training
Sravanti Addepalli, Samyak Jain
Published: 2022
The role of’sign’and’direction’of gradient on the performance of cnn
A. Agarwal, R. Singh, M. Vatsa
Published: 2020
Sign bits are all you need for black-box attacks
Abdullah Al-Dujaili, Una-May O’Reilly
Published: 2019
Understanding and improving fast adversarial training
M. Andriushchenko, N. Flammarion
Published: 2020
Fine-grained analysis of optimization and generalization for overparameterized two-layer neural networks
S. Arora, S. Du, W. Hu, Z. Li, R. Wang
Published: 2019
Towards Evaluating the Robustness of Neural Networks
Nicholas Carlini, David Wagner
Published: 2016.8.17
RayS: A Ray Searching Method for Hard-label Adversarial Attack
Jinghui Chen, Quanquan Gu
Published: 2020.6.23
ZOO: Zeroth Order Optimization based Black-box Attacks to Deep Neural Networks without Training Substitute Models
Pin-Yu Chen, Huan Zhang, Yash Sharma, Jinfeng Yi, Cho-Jui Hsieh
Published: 2017.8.14
RobustBench: a standardized adversarial robustness benchmark
F. Croce, M. Andriushchenko, V. Sehwag, E. Debenedetti, N. Flammarion, M. Chiang, P. Mittal, M. Hein
Published: 2021
Mma training: Direct input space margin maximization through adversarial training
G. W. Ding, Y. Sharma, K. Y. C. Lui, R. Huang
Published: 2020
Boosting Adversarial Attacks with Momentum
Yinpeng Dong, Fangzhou Liao, Tianyu Pang, Hang Su, Jun Zhu, Xiaolin Hu, Jianguo Li
Published: 2017.10.17
Gradient descent provably optimizes over-parameterized neural networks
S. S. Du, X. Zhai, B. Poczos, A. Singh
Published: 2019
Explaining and harnessing adversarial examples
Goodfellow, I. J., Shlens, J., Szegedy, C.
Published: 2015
Deep residual learning for image recognition
Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun
Published: 2016
Long short-term memory
S. Hochreiter, J. Schmidhuber
Published: 1997
Densely connected convolutional networks
G. Huang, Z. Liu, L. Van Der Maaten, K. Q. Weinberger
Published: 2017
Exploring Architectural Ingredients of Adversarially Robust Deep Neural Networks
Hanxun Huang, Yisen Wang, Sarah Monazam Erfani, Quanquan Gu, James Bailey, Xingjun Ma
Published: 2021.10.8
Las-at: adversarial training with learnable attack strategy
X. Jia, Y. Zhang, B. Wu, K. Ma, J. Wang, X. Cao
Published: 2022
Imagenet classification with deep convolutional neural networks
Alex Krizhevsky, Ilya Sutskever, Geoffrey E Hinton
Published: 2012
Adversarial examples in the physical world
Alexey Kurakin, Ian Goodfellow, Samy Bengio
Published: 2016.7.9
signsgd via zeroth-order oracle
Liu, S., Chen, P., Chen, X., Hong, M.
Published: 2019
Towards Deep Learning Models Resistant to Adversarial Attacks
Aleksander Madry, Aleksandar Makelov, Ludwig Schmidt, Dimitris Tsipras, Adrian Vladu
Published: 2017.6.20
Deepfool: a simple and accurate method to fool deep neural networks
S.-M. Moosavi-Dezfooli, A. Fawzi, P. Frossard
Published: 2016
Rectified linear units improve restricted boltzmann machines
V. Nair, G. E. Hinton
Published: 2010
Overfitting in adversarially robust deep learning
L. Rice, E. Wong, Z. Kolter
Published: 2020
Do adversarially robust ImageNet models transfer better?
H. Salman, A. Ilyas, L. Engstrom, A. Kapoor, A. Madry
Published: 2020
Very deep convolutional networks for large-scale image recognition
K. Simonyan, A. Zisserman
Published: 2015
Intriguing properties of neural networks
C. Szegedy, W. Zaremba, I. Sutskever, J. Bruna, D. Erhan, I. Goodfellow, R. Fergus
Published: 2014
Going deeper with convolutions
C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. E. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, A. Rabinovich
Published: 2015
Better diffusion models further improve adversarial training
Zekai Wang, Tianyu Pang, Chao Du, Min Lin, Weiwei Liu, Shuicheng Yan
Published: 2023
Fast is better than free: Revisiting adversarial training
Eric Wong, Leslie Rice, J. Zico Kolter
Published: 2020.1.13
Adversarial weight perturbation helps robust generalization
D. Wu, S. tao Xia, Y. Wang
Published: 2020
Geometry-aware instance-reweighted adversarial training
J. Zhang, J. Zhu, G. Niu, B. Han, M. Sugiyama, M. Kankanhalli
Published: 2021
Adversarial robustness through the lens of causality
Yonggang Zhang, Mingming Gong, Tongliang Liu, Gang Niu, Xinmei Tian, Bo Han, Bernhard Scholkopf, Kun Zhang
Published: 2022
Share