Deep k-NN Defense against Clean-label Data Poisoning Attacks

TOP 文献データベース Deep k-NN Defense against Clean-label Data Poisoning Attacks

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1909.13374

PDF

https://arxiv.org/pdf/1909.13374

文献情報

作者: Neehar Peri,Neal Gupta,W. Ronny Huang,Liam Fowl,Chen Zhu,Soheil Feizi,Tom Goldstein,John P. Dickerson
公開日: 2019-9-30
更新日: 2020-8-13
所属機関: Center for Machine Learning, University of Maryland - College Park
所属の国: United States of America
会議名: ECCV Workshops

AIにより推定されたラベル

毒データの検知性能評価バックドア攻撃

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Targeted clean-label data poisoning is a type of adversarial attack on machine learning systems in which an adversary injects a few correctly-labeled, minimally-perturbed samples into the training data, causing a model to misclassify a particular test sample during inference. Although defenses have been proposed for general poisoning attacks, no reliable defense for clean-label attacks has been demonstrated, despite the attacks' effectiveness and realistic applications. In this work, we propose a simple, yet highly-effective Deep k-NN defense against both feature collision and convex polytope clean-label attacks on the CIFAR-10 dataset. We demonstrate that our proposed strategy is able to detect over 99% of poisoned examples in both attacks and remove them without compromising model performance. Additionally, through ablation studies, we discover simple guidelines for selecting the value of k as well as for implementing the Deep k-NN defense on real-world datasets with class imbalance. Our proposed defense shows that current clean-label poisoning attack strategies can be annulled, and serves as a strong yet simple-to-implement baseline defense to test future clean-label poisoning attacks. Our code is available at https://github.com/neeharperi/DeepKNNDefense

外部データセット

CIFAR-10