Less is More: Culling the Training Set to Improve Robustness of Deep Neural Networks

TOP 文献データベース Less is More: Culling the Training Set to Improve Robustness of Deep Neural Networks

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1801.02850

PDF

https://arxiv.org/pdf/1801.02850

文献情報

作者: Yongshuai Liu,Jiyu Chen,Hao Chen
公開日: 2018-1-9
更新日: 2018-12-8
所属機関: University of California, Davis
所属の国: United States of America
会議名: Decision and Game Theory for Security (GameSec)

AIにより推定されたラベル

敵対的サンプル敵対的攻撃検出モデルの頑健性保証

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Deep neural networks are vulnerable to adversarial examples. Prior defenses attempted to make deep networks more robust by either changing the network architecture or augmenting the training set with adversarial examples, but both have inherent limitations. Motivated by recent research that shows outliers in the training set have a high negative influence on the trained model, we studied the relationship between model robustness and the quality of the training set. We first show that outliers give the model better generalization ability but weaker robustness. Next, we propose an adversarial example detection framework, in which we design two methods for removing outliers from training set to obtain the sanitized model and then detect adversarial example by calculating the difference of outputs between the original and the sanitized model. We evaluated the framework on both MNIST and SVHN. Based on the difference measured by Kullback-Leibler divergence, we could detect adversarial examples with accuracy between 94.67% to 99.89%.

外部データセット

MNIST

SVHN