Interpreting and Evaluating Neural Network Robustness

TOP 文献データベース Interpreting and Evaluating Neural Network Robustness

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1905.04270

PDF

https://arxiv.org/pdf/1905.04270

文献情報

作者: Fuxun Yu,Zhuwei Qin,Chenchen Liu,Liang Zhao,Yanzhi Wang,Xiang Chen
公開日: 2019-5-11
所属機関: George Mason University
所属の国: United States of America
会議名: International Joint Conference on Artificial Intelligence (IJCAI)

AIにより推定されたラベル

敵対的サンプルロバスト推定堅牢性検証手法

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Recently, adversarial deception becomes one of the most considerable threats to deep neural networks. However, compared to extensive research in new designs of various adversarial attacks and defenses, the neural networks' intrinsic robustness property is still lack of thorough investigation. This work aims to qualitatively interpret the adversarial attack and defense mechanism through loss visualization, and establish a quantitative metric to evaluate the neural network model's intrinsic robustness. The proposed robustness metric identifies the upper bound of a model's prediction divergence in the given domain and thus indicates whether the model can maintain a stable prediction. With extensive experiments, our metric demonstrates several advantages over conventional adversarial testing accuracy based robustness estimation: (1) it provides a uniformed evaluation to models with different structures and parameter scales; (2) it over-performs conventional accuracy based robustness estimation and provides a more reliable evaluation that is invariant to different test settings; (3) it can be fast generated without considerable testing cost.

外部データセット

MNIST

CIFAR10

ImageNet