Poison is Not Traceless: Fully-Agnostic Detection of Poisoning Attacks

TOP Literature Database Poison is Not Traceless: Fully-Agnostic Detection of Poisoning Attacks

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2310.16224

PDF

https://arxiv.org/pdf/2310.16224

Paper Information

Author: Xinglong Chang;Katharina Dost;Gillian Dobbie;Jörg Wicker
Published: 10-25-2023
Affiliation: The University of Auckland
Country: New Zealand
Conference: Computing Research Repository (CoRR)

Labels Estimated by AI

Poisoning Adversarial Attack Detection Data Generation

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

The performance of machine learning models depends on the quality of the underlying data. Malicious actors can attack the model by poisoning the training data. Current detectors are tied to either specific data types, models, or attacks, and therefore have limited applicability in real-world scenarios. This paper presents a novel fully-agnostic framework, DIVA (Detecting InVisible Attacks), that detects attacks solely relying on analyzing the potentially poisoned data set. DIVA is based on the idea that poisoning attacks can be detected by comparing the classifier's accuracy on poisoned and clean data and pre-trains a meta-learner using Complexity Measures to estimate the otherwise unknown accuracy on a hypothetical clean dataset. The framework applies to generic poisoning attacks. For evaluation purposes, in this paper, we test DIVA on label-flipping attacks.

External Datasets

Banknote

Breastcancer

HTRU2

Ringnorm

Texture

Abalone

Australian

CMC

Phoneme

Yeast