Localized Uncertainty Attacks

TOP Literature Database Localized Uncertainty Attacks

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2106.09222

PDF

https://arxiv.org/pdf/2106.09222

Paper Information

Author: Ousmane Amadou Dia;Theofanis Karaletsos;Caner Hazirbas;Cristian Canton Ferrer;Ilknur Kaynar Kabul;Erik Meijer
Published: 6-17-2021
Affiliation: Facebook
Country: United States of America
Conference: Computing Research Repository (CoRR)

Labels Estimated by AI

Adversarial Example Uncertainty Estimation Cyber Attack

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

The susceptibility of deep learning models to adversarial perturbations has stirred renewed attention in adversarial examples resulting in a number of attacks. However, most of these attacks fail to encompass a large spectrum of adversarial perturbations that are imperceptible to humans. In this paper, we present localized uncertainty attacks, a novel class of threat models against deterministic and stochastic classifiers. Under this threat model, we create adversarial examples by perturbing only regions in the inputs where a classifier is uncertain. To find such regions, we utilize the predictive uncertainty of the classifier when the classifier is stochastic or, we learn a surrogate model to amortize the uncertainty when it is deterministic. Unlike $\ell_p$ ball or functional attacks which perturb inputs indiscriminately, our targeted changes can be less perceptible. When considered under our threat model, these attacks still produce strong adversarial examples; with the examples retaining a greater degree of similarity with the inputs.

External Datasets

CIFAR-10

MNIST

STL-10

ImageNet