Poisoning Web-Scale Training Datasets is Practical

TOP Literature Database Poisoning Web-Scale Training Datasets is Practical

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2302.10149

PDF

https://arxiv.org/pdf/2302.10149

Paper Information

Author: Nicholas Carlini,Matthew Jagielski,Christopher A. Choquette-Choo,Daniel Paleka,Will Pearce,Hyrum Anderson,Andreas Terzis,Kurt Thomas,Florian Tramèr
Published: 2-21-2023
Updated: 5-6-2024
Affiliation: Google DeepMind
Country: United States of America
Conference: SP

Labels Estimated by AI

Attack Scenario Analysis Poisoning Adversarial attack

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Deep learning models are often trained on distributed, web-scale datasets crawled from the internet. In this paper, we introduce two new dataset poisoning attacks that intentionally introduce malicious examples to a model's performance. Our attacks are immediately practical and could, today, poison 10 popular datasets. Our first attack, split-view poisoning, exploits the mutable nature of internet content to ensure a dataset annotator's initial view of the dataset differs from the view downloaded by subsequent clients. By exploiting specific invalid trust assumptions, we show how we could have poisoned 0.01% of the LAION-400M or COYO-700M datasets for just $60 USD. Our second attack, frontrunning poisoning, targets web-scale datasets that periodically snapshot crowd-sourced content -- such as Wikipedia -- where an attacker only needs a time-limited window to inject malicious examples. In light of both attacks, we notify the maintainers of each affected dataset and recommended several low-overhead defenses.

External Datasets

LAION-400M

COYO-700M

Wikipedia

Common Crawl