Towards Deep Neural Network Architectures Robust to Adversarial Examples

TOP Literature Database Towards Deep Neural Network Architectures Robust to Adversarial Examples

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/1412.5068

PDF

https://arxiv.org/pdf/1412.5068

Paper Information

Author: Shixiang Gu,Luca Rigazio
Published: 12-12-2014
Updated: 4-10-2015
Affiliation: Panasonic Silicon Valley Laboratory
Country: United States of America
Conference: International Conference on Learning Representations (ICLR)

Labels Estimated by AI

Robustness Certified Robustness Deep Learning Technology

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Recent work has shown deep neural networks (DNNs) to be highly susceptible to well-designed, small perturbations at the input layer, or so-called adversarial examples. Taking images as an example, such distortions are often imperceptible, but can result in 100% mis-classification for a state of the art DNN. We study the structure of adversarial examples and explore network topology, pre-processing and training strategies to improve the robustness of DNNs. We perform various experiments to assess the removability of adversarial examples by corrupting with additional noise and pre-processing with denoising autoencoders (DAEs). We find that DAEs can remove substantial amounts of the adversarial noise. How- ever, when stacking the DAE with the original DNN, the resulting network can again be attacked by new adversarial examples with even smaller distortion. As a solution, we propose Deep Contractive Network, a model with a new end-to-end training procedure that includes a smoothness penalty inspired by the contractive autoencoder (CAE). This increases the network robustness to adversarial examples, without a significant performance penalty.

External Datasets

MNIST