Certifiably Robust Variational Autoencoders

TOP 文献データベース Certifiably Robust Variational Autoencoders

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2102.07559

PDF

https://arxiv.org/pdf/2102.07559

文献情報

作者: Ben Barrett;Alexander Camuto;Matthew Willetts;Tom Rainforth
公開日: 2021-2-15
更新日: 2022-4-23
所属機関: University of Oxford
所属の国: United Kingdom
会議名: International Conference on Artificial Intelligence and Statistics (AISTATS)

AIにより推定されたラベル

モデルアーキテクチャデータ生成ウォーターマーキング

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

We introduce an approach for training Variational Autoencoders (VAEs) that are certifiably robust to adversarial attack. Specifically, we first derive actionable bounds on the minimal size of an input perturbation required to change a VAE's reconstruction by more than an allowed amount, with these bounds depending on certain key parameters such as the Lipschitz constants of the encoder and decoder. We then show how these parameters can be controlled, thereby providing a mechanism to ensure \textit{a priori} that a VAE will attain a desired level of robustness. Moreover, we extend this to a complete practical approach for training such VAEs to ensure our criteria are met. Critically, our method allows one to specify a desired level of robustness \emph{upfront} and then train a VAE that is guaranteed to achieve this robustness. We further demonstrate that these Lipschitz--constrained VAEs are more robust to attack than standard VAEs in practice.