One Man's Trash is Another Man's Treasure: Resisting Adversarial Examples by Adversarial Examples

TOP Literature Database One Man's Trash is Another Man's Treasure: Resisting Adversarial Examples by Adversarial Examples

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/1911.11219

PDF

https://arxiv.org/pdf/1911.11219

Paper Information

Author: Chang Xiao,Changxi Zheng
Published: 11-26-2019
Updated: 11-28-2019
Affiliation: Columbia University
Country: United States of America
Conference: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Labels Estimated by AI

Effectiveness Analysis of Defense Methods Adversarial Example Adversarial Attack Methods

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Modern image classification systems are often built on deep neural networks, which suffer from adversarial examples--images with deliberately crafted, imperceptible noise to mislead the network's classification. To defend against adversarial examples, a plausible idea is to obfuscate the network's gradient with respect to the input image. This general idea has inspired a long line of defense methods. Yet, almost all of them have proven vulnerable. We revisit this seemingly flawed idea from a radically different perspective. We embrace the omnipresence of adversarial examples and the numerical procedure of crafting them, and turn this harmful attacking process into a useful defense mechanism. Our defense method is conceptually simple: before feeding an input image for classification, transform it by finding an adversarial example on a pre-trained external model. We evaluate our method against a wide range of possible attacks. On both CIFAR-10 and Tiny ImageNet datasets, our method is significantly more robust than state-of-the-art methods. Particularly, in comparison to adversarial training, our method offers lower training cost as well as stronger robustness.

External Datasets

CIFAR-10

Tiny ImageNet