Provably robust deep generative models

TOP 文献データベース Provably robust deep generative models

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2004.10608

PDF

https://arxiv.org/pdf/2004.10608

文献情報

作者: Filipe Condessa,Zico Kolter
公開日: 2020-4-22
所属機関: Bosch Center for Artificial Intelligence
所属の国: United States of America
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

堅牢性向上手法敵対的攻撃深層学習手法

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Recent work in adversarial attacks has developed provably robust methods for training deep neural network classifiers. However, although they are often mentioned in the context of robustness, deep generative models themselves have received relatively little attention in terms of formally analyzing their robustness properties. In this paper, we propose a method for training provably robust generative models, specifically a provably robust version of the variational auto-encoder (VAE). To do so, we first formally define a (certifiably) robust lower bound on the variational lower bound of the likelihood, and then show how this bound can be optimized during training to produce a robust VAE. We evaluate the method on simple examples, and show that it is able to produce generative models that are substantially more robust to adversarial attacks (i.e., an adversary trying to perturb inputs so as to drastically lower their likelihood under the model).