Featurized Bidirectional GAN: Adversarial Defense via Adversarially Learned Semantic Inference

TOP 文献データベース Featurized Bidirectional GAN: Adversarial Defense via Adversarially Learned Semantic Inference

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1805.07862

PDF

https://arxiv.org/pdf/1805.07862

文献情報

作者: Ruying Bao,Sihang Liang,Qingcan Wang
公開日: 2018-5-21
更新日: 2018-9-30
所属機関: Program in Applied and Computational Mathematics, Princeton University
所属の国: United States of America
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

透かし設計モデルの頑健性保証敵対的攻撃検出

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Deep neural networks have been demonstrated to be vulnerable to adversarial attacks, where small perturbations intentionally added to the original inputs can fool the classifier. In this paper, we propose a defense method, Featurized Bidirectional Generative Adversarial Networks (FBGAN), to extract the semantic features of the input and filter the non-semantic perturbation. FBGAN is pre-trained on the clean dataset in an unsupervised manner, adversarially learning a bidirectional mapping between the high-dimensional data space and the low-dimensional semantic space; also mutual information is applied to disentangle the semantically meaningful features. After the bidirectional mapping, the adversarial data can be reconstructed to denoised data, which could be fed into any pre-trained classifier. We empirically show the quality of reconstruction images and the effectiveness of defense.

外部データセット

MNIST

Fashion MNIST

SVHN