Implicit Generative Modeling of Random Noise during Training for Adversarial Robustness

TOP 文献データベース Implicit Generative Modeling of Random Noise during Training for Adversarial Robustness

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1807.02188

PDF

https://arxiv.org/pdf/1807.02188

文献情報

作者: Priyadarshini Panda,Kaushik Roy
公開日: 2018-7-6
更新日: 2019-6-1
所属機関: Department of Electrical & Computer Engineering, Purdue University
所属の国: United States of America
会議名

AIにより推定されたラベル

モデルの頑健性保証敵対的学習データ生成

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

We introduce a Noise-based prior Learning (NoL) approach for training neural networks that are intrinsically robust to adversarial attacks. We find that the implicit generative modeling of random noise with the same loss function used during posterior maximization, improves a model's understanding of the data manifold furthering adversarial robustness. We evaluate our approach's efficacy and provide a simplistic visualization tool for understanding adversarial data, using Principal Component Analysis. Our analysis reveals that adversarial robustness, in general, manifests in models with higher variance along the high-ranked principal components. We show that models learnt with our approach perform remarkably well against a wide-range of attacks. Furthermore, combining NoL with state-of-the-art adversarial training extends the robustness of a model, even beyond what it is adversarially trained for, in both white-box and black-box attack scenarios.