Localized Adversarial Training for Increased Accuracy and Robustness in Image Classification

TOP 文献データベース Localized Adversarial Training for Increased Accuracy and Robustness in Image Classification

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1909.04779

PDF

https://arxiv.org/pdf/1909.04779

文献情報

作者: Eitan Rothberg,Tingting Chen,Luo Jie,Hao Ji
公開日: 2019-9-11
所属機関: Ohio State University, Columbus, Ohio, 43210, USA
所属の国: United States of America
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

敵対的サンプル適応型敵対的訓練背景ピクセル攻撃

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Today's state-of-the-art image classifiers fail to correctly classify carefully manipulated adversarial images. In this work, we develop a new, localized adversarial attack that generates adversarial examples by imperceptibly altering the backgrounds of normal images. We first use this attack to highlight the unnecessary sensitivity of neural networks to changes in the background of an image, then use it as part of a new training technique: localized adversarial training. By including locally adversarial images in the training set, we are able to create a classifier that suffers less loss than a non-adversarially trained counterpart model on both natural and adversarial inputs. The evaluation of our localized adversarial training algorithm on MNIST and CIFAR-10 datasets shows decreased accuracy loss on natural images, and increased robustness against adversarial inputs.