Formal Guarantees on the Robustness of a Classifier against Adversarial Manipulation

TOP 文献データベース Formal Guarantees on the Robustness of a Classifier against Adversarial Manipulation

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1705.08475

PDF

https://arxiv.org/pdf/1705.08475

文献情報

作者: Matthias Hein,Maksym Andriushchenko
公開日: 2017-5-24
更新日: 2017-11-6
所属機関: Department of Mathematics and Computer Science, Saarland University
所属の国: Germany
会議名

AIにより推定されたラベル

ロバスト性とプライバシーの関係敵対的学習モデルの頑健性保証

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Recent work has shown that state-of-the-art classifiers are quite brittle, in the sense that a small adversarial change of an originally with high confidence correctly classified input leads to a wrong classification again with high confidence. This raises concerns that such classifiers are vulnerable to attacks and calls into question their usage in safety-critical systems. We show in this paper for the first time formal guarantees on the robustness of a classifier by giving instance-specific lower bounds on the norm of the input manipulation required to change the classifier decision. Based on this analysis we propose the Cross-Lipschitz regularization functional. We show that using this form of regularization in kernel methods resp. neural networks improves the robustness of the classifier without any loss in prediction performance.

外部データセット

German Traffic Sign Benchmark (GTSB)