PuVAE: A Variational Autoencoder to Purify Adversarial Examples

TOP 文献データベース PuVAE: A Variational Autoencoder to Purify Adversarial Examples

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1903.00585

PDF

https://arxiv.org/pdf/1903.00585

文献情報

作者: Uiwon Hwang,Jaewoo Park,Hyemi Jang,Sungroh Yoon,Nam Ik Cho
公開日: 2019-3-2
所属機関: Department of Electrical and Computer Engineering, Seoul National University
所属の国: Korea
会議名: IEEE Access

AIにより推定されたラベル

堅牢性向上手法ポイズニング敵対的摂動手法

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Deep neural networks are widely used and exhibit excellent performance in many areas. However, they are vulnerable to adversarial attacks that compromise the network at the inference time by applying elaborately designed perturbation to input data. Although several defense methods have been proposed to address specific attacks, other attack methods can circumvent these defense mechanisms. Therefore, we propose Purifying Variational Autoencoder (PuVAE), a method to purify adversarial examples. The proposed method eliminates an adversarial perturbation by projecting an adversarial example on the manifold of each class, and determines the closest projection as a purified sample. We experimentally illustrate the robustness of PuVAE against various attack methods without any prior knowledge. In our experiments, the proposed method exhibits performances competitive with state-of-the-art defense methods, and the inference time is approximately 130 times faster than that of Defense-GAN that is the state-of-the art purifier model.