Potential adversarial samples for white-box attacks

TOP 文献データベース Potential adversarial samples for white-box attacks

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1912.06409

PDF

https://arxiv.org/pdf/1912.06409

文献情報

作者: Amir Nazemi,Paul Fieguth
公開日: 2019-12-13
所属機関: Department of Systems Design Engineering, University of Waterloo
所属の国: Canada
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

深層学習手法敵対的スペクトル攻撃検出ロバスト性

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Deep convolutional neural networks can be highly vulnerable to small perturbations of their inputs, potentially a major issue or limitation on system robustness when using deep networks as classifiers. In this paper we propose a low-cost method to explore marginal sample data near trained classifier decision boundaries, thus identifying potential adversarial samples. By finding such adversarial samples it is possible to reduce the search space of adversarial attack algorithms while keeping a reasonable successful perturbation rate. In our developed strategy, the potential adversarial samples represent only 61% of the test data, but in fact cover more than 82% of the adversarial samples produced by iFGSM and 92% of the adversarial samples successfully perturbed by DeepFool on CIFAR10.

外部データセット

MNIST

CIFAR10