AIセキュリティポータル K Program
How many dimensions are required to find an adversarial example?
Share
Abstract
Past work exploring adversarial vulnerability have focused on situations where an adversary can perturb all dimensions of model input. On the other hand, a range of recent works consider the case where either (i) an adversary can perturb a limited number of input parameters or (ii) a subset of modalities in a multimodal problem. In both of these cases, adversarial examples are effectively constrained to a subspace $V$ in the ambient input space $\mathcal{X}$. Motivated by this, in this work we investigate how adversarial vulnerability depends on $\dim(V)$. In particular, we show that the adversarial success of standard PGD attacks with $\ell^p$ norm constraints behaves like a monotonically increasing function of $\epsilon (\frac{\dim(V)}{\dim \mathcal{X}})^{\frac{1}{q}}$ where $\epsilon$ is the perturbation budget and $\frac{1}{p} + \frac{1}{q} =1$, provided $p > 1$ (the case $p=1$ presents additional subtleties which we analyze in some detail). This functional form can be easily derived from a simple toy linear model, and as such our results land further credence to arguments that adversarial examples are endemic to locally linear models on high dimensional spaces.
Optimization with Sparsity-Inducing Penalties
Francis Bach, Rodolphe Jenatton, Julien Mairal, Guillaume Obozinski
Published: 2011
A Note on Quantiles in Large Samples
R. R. Bahadur
Published: 1966
Adversarial examples in multi-layer random relu networks
P. Bartlett, S. Bubeck, Y. Cherapanamjeri
Published: 2021
A single gradient step finds adversarial examples on random two-layers neural networks
S. Bubeck, Y. Cherapanamjeri, G. Gidel, R. Tachet des Combes
Published: 2021
Curse of dimensionality in adversarial examples
Nandish Chattopadhyay, Anupam Chattopadhyay, Sourav Sen Gupta, Michael Kasper
Published: 2019
Intriguing Properties of Adversarial Examples
Ekin D. Cubuk, Barret Zoph, Samuel S. Schoenholz, Quoc V. Le
Published: 11.8.2017
Imagenet: A large-scale hierarchical image database
J. Deng, W. Dong, R. Socher, L. Li, K. Li, L. Fei-Fei
Published: 2009
Origins of low-dimensional adversarial perturbations
Elvis Dohmatob, Chuan Guo, Morgane Goibert
Published: 2022
Analysis of classifiers’ robustness to adversarial perturbations
A. Fawzi, O. Fawzi, P. Frossard
Published: 2018
Robustness of classifiers: from adversarial to random noise
Alhussein Fawzi, Seyed-Mohsen Moosavi-Dezfooli, Pascal Frossard
Published: 2016
Robustness of classifiers to uniform `p and gaussian noise
Jean-Yves Franceschi, Alhussein Fawzi, Omar Fawzi
Published: 2018
Motivating the Rules of the Game for Adversarial Example Research
Justin Gilmer, Ryan P. Adams, Ian Goodfellow, David Andersen, George E. Dahl
Published: 7.18.2018
A Discussion of ’Adversarial Examples Are Not Bugs, They Are Features’
Justin Gilmer, Dan Hendrycks
Published: 2019
Explaining and harnessing adversarial examples
Ian Goodfellow, Jonathon Shlens, Christian Szegedy
Published: 2015
Low Frequency Adversarial Perturbation
Chuan Guo, Jared S. Frank, Kilian Q. Weinberger
Deep residual learning for image recognition
Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun
Published: 2016
On the geometry of adversarial examples
Marc Khoury, Dylan Hadfield-Menell
Published: 2018
Learning multiple layers of features from tiny images
Alex Krizhevsky, Geoffrey Hinton
Published: 2009
Black box attacks on deep anomaly detectors
Aditya Kuppa, Slawomir Grzonkowski, Muhammad Rizwan Asghar, Nhien-An Le-Khac
Published: 2019
Functional adversarial attacks
Cassidy Laidlaw, Soheil Feizi
Published: 2019
Towards Deep Learning Models Resistant to Adversarial Attacks
Aleksander Madry, Aleksandar Makelov, Ludwig Schmidt, Dimitris Tsipras, Adrian Vladu
Published: 6.20.2017
Torchvision the machine-vision package of torch
Sébastien Marcel, Yann Rodriguez
Published: 2010
Deepfool: a simple and accurate method to fool deep neural networks
S.-M. Moosavi-Dezfooli, A. Fawzi, P. Frossard
Published: 2016
On the spectral bias of neural networks
Nasim Rahaman, Aristide Baratin, Devansh Arpit, Felix Draxler, Min Lin, Fred Hamprecht, Yoshua Bengio, Aaron Courville
Published: 2019
Are adversarial examples inevitable?
Ali Shafahi, W. Ronny Huang, Christoph Studer, Soheil Feizi, Tom Goldstein
Published: 2019
On the effectiveness of low frequency perturbations
Y. Sharma, G. W. Ding, M. A. Brubaker
Published: 2019
First-order adversarial vulnerability of neural networks and input dimension
Carl-Johann Simon-Gabriel, Yann Ollivier, Leon Bottou, Bernhard Schölkopf, David Lopez-Paz
Published: 2019
Disentangling adversarial robustness and generalization
David Stutz, Matthias Hein, Bernt Schiele
Published: 2019
Intriguing properties of neural networks
C. Szegedy, W. Zaremba, I. Sutskever, J. Bruna, D. Erhan, I. Goodfellow, R. Fergus
Published: 2014
SciPy 1.0: Fundamental algorithms for scientific computing in python
P. Virtanen, R. Gommers, T. E. Oliphant, M. Haberland, T. Reddy, D. Cournapeau, E. Burovski, P. Peterson, W. Weckesser, J. Bright, S. J. van der Walt, M. Brett, J. Wilson, K. J. Millman, N. Mayorov, A. R. J. Nelson, E. Jones, R. Kern, E. Larson, C. J. Carey, I. Polat, Y. Feng, E. W. Moore, J. VanderPlas, D. Laxalde, J. Perk t old, R. Cimrman, I. Henriksen, E. A. Quintero, C. R. Harris, A. M. Archibald, A. H. Ribeiro, F. Pedregosa, P. van Mulbregt, SciPy 1.0 Contributors
Published: 2020
Adversarial examples for semantic segmentation and object detection
Cihang Xie, Jianyu Wang, Zhishuai Zhang, Yuyin Zhou, Lingxi Xie, Alan Yuille
Published: 2017
A fourier perspective on model robustness in computer vision
D. Yin, R. Gontijo Lopes, J. Shlens, E. D. Cubuk, J. Gilmer
Published: 2019
Adversarial color enhancement: Generating unrestricted adversarial images by optimizing a color filter
Zhengyu Zhao, Zhuoran Liu, Martha A. Larson
Published: 2020
Share