EAD: Elastic-Net Attacks to Deep Neural Networks via Adversarial Examples

TOP Literature Database EAD: Elastic-Net Attacks to Deep Neural Networks via Adversarial Examples

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/1709.04114

PDF

https://arxiv.org/pdf/1709.04114

Paper Information

Author: Pin-Yu Chen,Yash Sharma,Huan Zhang,Jinfeng Yi,Cho-Jui Hsieh
Published: 9-13-2017
Updated: 2-10-2018
Affiliation: AI Foundations Lab, IBM T. J. Watson Research Center
Country: United States of America
Conference: AAAI Conference on Artificial Intelligence (AAAI)

Labels Estimated by AI

Certified Robustness Adversarial Learning Adversarial Example

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Recent studies have highlighted the vulnerability of deep neural networks (DNNs) to adversarial examples - a visually indistinguishable adversarial image can easily be crafted to cause a well-trained model to misclassify. Existing methods for crafting adversarial examples are based on $L_2$ and $L_\infty$ distortion metrics. However, despite the fact that $L_1$ distortion accounts for the total variation and encourages sparsity in the perturbation, little has been developed for crafting $L_1$-based adversarial examples. In this paper, we formulate the process of attacking DNNs via adversarial examples as an elastic-net regularized optimization problem. Our elastic-net attacks to DNNs (EAD) feature $L_1$-oriented adversarial examples and include the state-of-the-art $L_2$ attack as a special case. Experimental results on MNIST, CIFAR10 and ImageNet show that EAD can yield a distinct set of adversarial examples with small $L_1$ distortion and attains similar attack performance to the state-of-the-art methods in different attack scenarios. More importantly, EAD leads to improved attack transferability and complements adversarial training for DNNs, suggesting novel insights on leveraging $L_1$ distortion in adversarial machine learning and security implications of DNNs.

External Datasets

MNIST

CIFAR10

ImageNet