Detection of Iterative Adversarial Attacks via Counter Attack

TOP Literature Database Detection of Iterative Adversarial Attacks via Counter Attack

J. Optim. Theory Appl.

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2009.11397

PDF

https://arxiv.org/pdf/2009.11397

Paper Information

Author: Matthias Rottmann;Kira Maag;Mathis Peyron;Natasa Krejic;Hanno Gottschalk
Published: 9-24-2020
Updated: 3-23-2021
Affiliation: University of Wuppertal, School of Mathematics and Natural Sciences
Country: Germany
Conference: J. Optim. Theory Appl.

Labels Estimated by AI

Selection and Evaluation of Optimization Algorithms Robustness Information Security

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Deep neural networks (DNNs) have proven to be powerful tools for processing unstructured data. However for high-dimensional data, like images, they are inherently vulnerable to adversarial attacks. Small almost invisible perturbations added to the input can be used to fool DNNs. Various attacks, hardening methods and detection methods have been introduced in recent years. Notoriously, Carlini-Wagner (CW) type attacks computed by iterative minimization belong to those that are most difficult to detect. In this work we outline a mathematical proof that the CW attack can be used as a detector itself. That is, under certain assumptions and in the limit of attack iterations this detector provides asymptotically optimal separation of original and attacked images. In numerical experiments, we experimentally validate this statement and furthermore obtain AUROC values up to 99.73% on CIFAR10 and ImageNet. This is in the upper part of the spectrum of current state-of-the-art detection rates for CW attacks.