Are Generative Classifiers More Robust to Adversarial Attacks?

TOP 文献データベース Are Generative Classifiers More Robust to Adversarial Attacks?

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1802.06552

PDF

https://arxiv.org/pdf/1802.06552

文献情報

作者: Yingzhen Li,John Bradshaw,Yash Sharma
公開日: 2018-2-19
更新日: 2019-5-27
所属機関: Microsoft Research Cambridge
所属の国: United Kingdom
会議名: International Conference on Machine Learning (ICML)

AIにより推定されたラベル

敵対的攻撃敵対的学習ロバスト性評価

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

There is a rising interest in studying the robustness of deep neural network classifiers against adversaries, with both advanced attack and defence techniques being actively developed. However, most recent work focuses on discriminative classifiers, which only model the conditional distribution of the labels given the inputs. In this paper, we propose and investigate the deep Bayes classifier, which improves classical naive Bayes with conditional deep generative models. We further develop detection methods for adversarial examples, which reject inputs with low likelihood under the generative model. Experimental results suggest that deep Bayes classifiers are more robust than deep discriminative classifiers, and that the proposed detection methods are effective against many recently proposed attacks.