Stochastic Activation Pruning for Robust Adversarial Defense

TOP 文献データベース Stochastic Activation Pruning for Robust Adversarial Defense

International Conference on Learning Representations (ICLR)

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1803.01442

PDF

https://arxiv.org/pdf/1803.01442

文献情報

作者: Guneet S. Dhillon,Kamyar Azizzadenesheli,Zachary C. Lipton,Jeremy Bernstein,Jean Kossaifi,Aran Khanna,Anima Anandkumar
公開日: 2018-3-5
所属機関: Amazon AI
所属の国: United States of America
会議名: International Conference on Learning Representations (ICLR)

AIにより推定されたラベル

機械学習技術敵対的学習敵対的サンプルの検知

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Neural networks are known to be vulnerable to adversarial examples. Carefully chosen perturbations to real images, while imperceptible to humans, induce misclassification and threaten the reliability of deep learning systems in the wild. To guard against adversarial examples, we take inspiration from game theory and cast the problem as a minimax zero-sum game between the adversary and the model. In general, for such games, the optimal strategy for both players requires a stochastic policy, also known as a mixed strategy. In this light, we propose Stochastic Activation Pruning (SAP), a mixed strategy for adversarial defense. SAP prunes a random subset of activations (preferentially pruning those with smaller magnitude) and scales up the survivors to compensate. We can apply SAP to pretrained networks, including adversarially trained models, without fine-tuning, providing robustness against adversarial examples. Experiments demonstrate that SAP confers robustness against attacks, increasing accuracy and preserving calibration.