Manifold Regularization for Locally Stable Deep Neural Networks

TOP Literature Database Manifold Regularization for Locally Stable Deep Neural Networks

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2003.04286

PDF

https://arxiv.org/pdf/2003.04286

Paper Information

Author: Charles Jin,Martin Rinard
Published: 3-10-2020
Updated: 9-23-2020
Affiliation: CSAIL
Country: United States of America
Conference

Labels Estimated by AI

Robustness Adversarial Example Training Method

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

We apply concepts from manifold regularization to develop new regularization techniques for training locally stable deep neural networks. Our regularizers are based on a sparsification of the graph Laplacian which holds with high probability when the data is sparse in high dimensions, as is common in deep learning. Empirically, our networks exhibit stability in a diverse set of perturbation models, including $\ell_2$, $\ell_\infty$, and Wasserstein-based perturbations; in particular, we achieve 40% adversarial accuracy on CIFAR-10 against an adaptive PGD attack using $\ell_\infty$ perturbations of size $\epsilon = 8/255$, and state-of-the-art verified accuracy of 21% in the same perturbation model. Furthermore, our techniques are efficient, incurring overhead on par with two additional parallel forward passes through the network.

External Datasets

CIFAR-10