Efficient and Transferable Adversarial Examples from Bayesian Neural Networks

TOP 文献データベース Efficient and Transferable Adversarial Examples from Bayesian Neural Networks

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2011.05074

PDF

https://arxiv.org/pdf/2011.05074

文献情報

作者: Martin Gubri;Maxime Cordy;Mike Papadakis;Yves Le Traon;Koushik Sen
公開日: 2020-11-10
更新日: 2022-6-19
所属機関: University of Luxembourg
所属の国: Luxembourg
会議名: Conference on Uncertainty in Artificial Intelligence (UAI)

AIにより推定されたラベル

敵対的サンプル敵対的攻撃モデル性能評価

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

An established way to improve the transferability of black-box evasion attacks is to craft the adversarial examples on an ensemble-based surrogate to increase diversity. We argue that transferability is fundamentally related to uncertainty. Based on a state-of-the-art Bayesian Deep Learning technique, we propose a new method to efficiently build a surrogate by sampling approximately from the posterior distribution of neural network weights, which represents the belief about the value of each parameter. Our extensive experiments on ImageNet, CIFAR-10 and MNIST show that our approach improves the success rates of four state-of-the-art attacks significantly (up to 83.2 percentage points), in both intra-architecture and inter-architecture transferability. On ImageNet, our approach can reach 94% of success rate while reducing training computations from 11.6 to 2.4 exaflops, compared to an ensemble of independently trained DNNs. Our vanilla surrogate achieves 87.5% of the time higher transferability than three test-time techniques designed for this purpose. Our work demonstrates that the way to train a surrogate has been overlooked, although it is an important element of transfer-based attacks. We are, therefore, the first to review the effectiveness of several training methods in increasing transferability. We provide new directions to better understand the transferability phenomenon and offer a simple but strong baseline for future work.