Inspecting adversarial examples using the Fisher information

TOP 文献データベース Inspecting adversarial examples using the Fisher information

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1909.05527

PDF

https://arxiv.org/pdf/1909.05527

文献情報

作者: Jörg Martin,Clemens Elster
公開日: 2019-9-12
所属機関: Physikalisch-Technische Bundesanstalt (PTB)
所属の国: Germany
会議名: Neurocomputing

AIにより推定されたラベル

敵対的サンプル攻撃検出 Fisher情報感度

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Adversarial examples are slight perturbations that are designed to fool artificial neural networks when fed as an input. In this work the usability of the Fisher information for the detection of such adversarial attacks is studied. We discuss various quantities whose computation scales well with the network size, study their behavior on adversarial examples and show how they can highlight the importance of single input neurons, thereby providing a visual tool for further analyzing (un-)reasonable behavior of a neural network. The potential of our methods is demonstrated by applications to the MNIST, CIFAR10 and Fruits-360 datasets.