Detecting Adversarial Examples from Sensitivity Inconsistency of Spatial-Transform Domain

TOP Literature Database Detecting Adversarial Examples from Sensitivity Inconsistency of Spatial-Transform Domain

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2103.04302

PDF

https://arxiv.org/pdf/2103.04302

Paper Information

Author: Jinyu Tian,Jiantao Zhou,Yuanman Li,Jia Duan
Published: 3-7-2021
Affiliation: State Key Laboratory of Internet of Things for Smart City, Department of Computer and Information Science, University of Macau
Country: Macau
Conference

Labels Estimated by AI

Adversarial Example Detection Attack Scenario Analysis Adversarial Learning

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Deep neural networks (DNNs) have been shown to be vulnerable against adversarial examples (AEs), which are maliciously designed to cause dramatic model output errors. In this work, we reveal that normal examples (NEs) are insensitive to the fluctuations occurring at the highly-curved region of the decision boundary, while AEs typically designed over one single domain (mostly spatial domain) exhibit exorbitant sensitivity on such fluctuations. This phenomenon motivates us to design another classifier (called dual classifier) with transformed decision boundary, which can be collaboratively used with the original classifier (called primal classifier) to detect AEs, by virtue of the sensitivity inconsistency. When comparing with the state-of-the-art algorithms based on Local Intrinsic Dimensionality (LID), Mahalanobis Distance (MD), and Feature Squeezing (FS), our proposed Sensitivity Inconsistency Detector (SID) achieves improved AE detection performance and superior generalization capabilities, especially in the challenging cases where the adversarial perturbation levels are small. Intensive experimental results on ResNet and VGG validate the superiority of the proposed SID.

External Datasets

CIFAR10

SVHN