Robustness bounds on the successful adversarial examples in probabilistic models: Implications from Gaussian processes

TOP Literature Database Robustness bounds on the successful adversarial examples in probabilistic models: Implications from Gaussian processes

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2403.01896

PDF

https://arxiv.org/pdf/2403.01896

Paper Information

Author: Hiroaki Maeshima,Akira Otsuka
Published: 3-4-2024
Updated: 3-19-2025
Affiliation: Institute of Information Security, Yokohama, Japan
Country: Japan
Conference: JSAI International Symposia on AI (JSAI-isAI)

Labels Estimated by AI

Adversarial Example Watermark Evaluation Attack Method

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Adversarial example (AE) is an attack method for machine learning, which is crafted by adding imperceptible perturbation to the data inducing misclassification. In the current paper, we investigated the upper bound of the probability of successful AEs based on the Gaussian Process (GP) classification, a probabilistic inference model. We proved a new upper bound of the probability of a successful AE attack that depends on AE's perturbation norm, the kernel function used in GP, and the distance of the closest pair with different labels in the training dataset. Surprisingly, the upper bound is determined regardless of the distribution of the sample dataset. We showed that our theoretical result was confirmed through the experiment using ImageNet. In addition, we showed that changing the parameters of the kernel function induces a change of the upper bound of the probability of successful AEs.

External Datasets

ImageNet