Resisting Adversarial Attacks using Gaussian Mixture Variational Autoencoders

TOP 文献データベース Resisting Adversarial Attacks using Gaussian Mixture Variational Autoencoders

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1806.00081

PDF

https://arxiv.org/pdf/1806.00081

文献情報

作者: Partha Ghosh,Arpan Losalka,Michael J Black
公開日: 2018-6-1
更新日: 2018-12-11
所属機関: Max Planck Institute of Intelligent Systems
所属の国: Germany
会議名: AAAI Conference on Artificial Intelligence (AAAI)

AIにより推定されたラベル

損失関数モデルの頑健性保証敵対的サンプル

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Susceptibility of deep neural networks to adversarial attacks poses a major theoretical and practical challenge. All efforts to harden classifiers against such attacks have seen limited success. Two distinct categories of samples to which deep networks are vulnerable, "adversarial samples" and "fooling samples", have been tackled separately so far due to the difficulty posed when considered together. In this work, we show how one can address them both under one unified framework. We tie a discriminative model with a generative model, rendering the adversarial objective to entail a conflict. Our model has the form of a variational autoencoder, with a Gaussian mixture prior on the latent vector. Each mixture component of the prior distribution corresponds to one of the classes in the data. This enables us to perform selective classification, leading to the rejection of adversarial samples instead of misclassification. Our method inherently provides a way of learning a selective classifier in a semi-supervised scenario as well, which can resist adversarial attacks. We also show how one can reclassify the rejected adversarial samples.

外部データセット

MNIST

SVHN

COIL-100