Manifold Regularization for Locally Stable Deep Neural Networks

TOP 文献データベース Manifold Regularization for Locally Stable Deep Neural Networks

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2003.04286

PDF

https://arxiv.org/pdf/2003.04286

文献情報

作者: Charles Jin,Martin Rinard
公開日: 2020-3-10
更新日: 2020-9-23
所属機関: CSAIL
所属の国: United States of America
会議名

AIにより推定されたラベル

ロバスト性敵対的サンプルトレーニング手法

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

We apply concepts from manifold regularization to develop new regularization techniques for training locally stable deep neural networks. Our regularizers are based on a sparsification of the graph Laplacian which holds with high probability when the data is sparse in high dimensions, as is common in deep learning. Empirically, our networks exhibit stability in a diverse set of perturbation models, including $\ell_2$, $\ell_\infty$, and Wasserstein-based perturbations; in particular, we achieve 40% adversarial accuracy on CIFAR-10 against an adaptive PGD attack using $\ell_\infty$ perturbations of size $\epsilon = 8/255$, and state-of-the-art verified accuracy of 21% in the same perturbation model. Furthermore, our techniques are efficient, incurring overhead on par with two additional parallel forward passes through the network.

外部データセット

CIFAR-10