Evaluating the Robustness of Neural Networks: An Extreme Value Theory Approach

TOP 文献データベース Evaluating the Robustness of Neural Networks: An Extreme Value Theory Approach

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1801.10578

PDF

https://arxiv.org/pdf/1801.10578

文献情報

作者: Tsui-Wei Weng,Huan Zhang,Pin-Yu Chen,Jinfeng Yi,Dong Su,Yupeng Gao,Cho-Jui Hsieh,Luca Daniel
公開日: 2018-2-1
所属機関: Massachusetts Institute of Technology
所属の国: United States of America
会議名: International Conference on Learning Representations (ICLR)

AIにより推定されたラベル

ロバスト性評価モデルの頑健性保証敵対的攻撃

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

The robustness of neural networks to adversarial examples has received great attention due to security implications. Despite various attack approaches to crafting visually imperceptible adversarial examples, little has been developed towards a comprehensive measure of robustness. In this paper, we provide a theoretical justification for converting robustness analysis into a local Lipschitz constant estimation problem, and propose to use the Extreme Value Theory for efficient evaluation. Our analysis yields a novel robustness metric called CLEVER, which is short for Cross Lipschitz Extreme Value for nEtwork Robustness. The proposed CLEVER score is attack-agnostic and computationally feasible for large neural networks. Experimental results on various networks, including ResNet, Inception-v3 and MobileNet, show that (i) CLEVER is aligned with the robustness indication measured by the $\ell_2$ and $\ell_\infty$ norms of adversarial examples from powerful attacks, and (ii) defended networks using defensive distillation or bounded ReLU indeed achieve better CLEVER scores. To the best of our knowledge, CLEVER is the first attack-independent robustness metric that can be applied to any neural network classifier.

外部データセット

CIFAR-10

MNIST

ImageNet