One Man's Trash is Another Man's Treasure: Resisting Adversarial Examples by Adversarial Examples

TOP 文献データベース One Man's Trash is Another Man's Treasure: Resisting Adversarial Examples by Adversarial Examples

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1911.11219

PDF

https://arxiv.org/pdf/1911.11219

文献情報

作者: Chang Xiao,Changxi Zheng
公開日: 2019-11-26
更新日: 2019-11-28
所属機関: Columbia University
所属の国: United States of America
会議名: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

AIにより推定されたラベル

防御手法の効果分析敵対的サンプル敵対的攻撃手法

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Modern image classification systems are often built on deep neural networks, which suffer from adversarial examples--images with deliberately crafted, imperceptible noise to mislead the network's classification. To defend against adversarial examples, a plausible idea is to obfuscate the network's gradient with respect to the input image. This general idea has inspired a long line of defense methods. Yet, almost all of them have proven vulnerable. We revisit this seemingly flawed idea from a radically different perspective. We embrace the omnipresence of adversarial examples and the numerical procedure of crafting them, and turn this harmful attacking process into a useful defense mechanism. Our defense method is conceptually simple: before feeding an input image for classification, transform it by finding an adversarial example on a pre-trained external model. We evaluate our method against a wide range of possible attacks. On both CIFAR-10 and Tiny ImageNet datasets, our method is significantly more robust than state-of-the-art methods. Particularly, in comparison to adversarial training, our method offers lower training cost as well as stronger robustness.