Local Competition and Uncertainty for Adversarial Robustness in Deep Learning

TOP 文献データベース Local Competition and Uncertainty for Adversarial Robustness in Deep Learning

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2006.10620

PDF

https://arxiv.org/pdf/2006.10620

文献情報

作者: Antonios Alexos,Konstantinos P. Panousis,Sotirios Chatzis
公開日: 2020-6-19
所属機関: Department of Electrical and Computer Engineering, University of Thessaly
所属の国: Greece
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

敵対的学習深層学習手法性能評価

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

This work attempts to address adversarial robustness of deep networks by means of novel learning arguments. Specifically, inspired from results in neuroscience, we propose a local competition principle as a means of adversarially-robust deep learning. We argue that novel local winner-takes-all (LWTA) nonlinearities, combined with posterior sampling schemes, can greatly improve the adversarial robustness of traditional deep networks against difficult adversarial attack schemes. We combine these LWTA arguments with tools from the field of Bayesian non-parametrics, specifically the stick-breaking construction of the Indian Buffet Process, to flexibly account for the inherent uncertainty in data-driven modeling. As we experimentally show, the new proposed model achieves high robustness to adversarial perturbations on MNIST and CIFAR10 datasets. Our model achieves state-of-the-art results in powerful white-box attacks, while at the same time retaining its benign accuracy to a high degree. Equally importantly, our approach achieves this result while requiring far less trainable model parameters than the existing state-of-the-art.