Adversarial Training is a Form of Data-dependent Operator Norm Regularization

TOP 文献データベース Adversarial Training is a Form of Data-dependent Operator Norm Regularization

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1906.01527

PDF

https://arxiv.org/pdf/1906.01527

文献情報

作者: Kevin Roth,Yannic Kilcher,Thomas Hofmann
公開日: 2019-6-5
更新日: 2020-10-23
所属機関: Dept of Computer Science, ETH Zürich
所属の国: Switzerland
会議名: Conference on Neural Information Processing Systems (NeurIPS)

AIにより推定されたラベル

敵対的訓練深層学習技術防御メカニズム

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

We establish a theoretical link between adversarial training and operator norm regularization for deep neural networks. Specifically, we prove that $\ell_p$-norm constrained projected gradient ascent based adversarial training with an $\ell_q$-norm loss on the logits of clean and perturbed inputs is equivalent to data-dependent (p, q) operator norm regularization. This fundamental connection confirms the long-standing argument that a network's sensitivity to adversarial examples is tied to its spectral properties and hints at novel ways to robustify and defend against adversarial attacks. We provide extensive empirical evidence on state-of-the-art network architectures to support our theoretical results.