Subpopulation Data Poisoning Attacks

TOP 文献データベース Subpopulation Data Poisoning Attacks

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2006.14026

PDF

https://arxiv.org/pdf/2006.14026

文献情報

作者: Matthew Jagielski;Giorgio Severi;Niklas Pousette Harger;Alina Oprea
公開日: 2020-6-25
更新日: 2021-5-13
所属機関: Khoury College of Computer Sciences, Northeastern University
所属の国: United States of America
会議名: Annual ACM Conference on Computer and Communications Security (CCS)

AIにより推定されたラベル

ポイズニング攻撃ポイズニングバックドア攻撃

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Machine learning systems are deployed in critical settings, but they might fail in unexpected ways, impacting the accuracy of their predictions. Poisoning attacks against machine learning induce adversarial modification of data used by a machine learning algorithm to selectively change its output when it is deployed. In this work, we introduce a novel data poisoning attack called a \emph{subpopulation attack}, which is particularly relevant when datasets are large and diverse. We design a modular framework for subpopulation attacks, instantiate it with different building blocks, and show that the attacks are effective for a variety of datasets and machine learning models. We further optimize the attacks in continuous domains using influence functions and gradient optimization methods. Compared to existing backdoor poisoning attacks, subpopulation attacks have the advantage of inducing misclassification in naturally distributed data points at inference time, making the attacks extremely stealthy. We also show that our attack strategy can be used to improve upon existing targeted attacks. We prove that, under some assumptions, subpopulation attacks are impossible to defend against, and empirically demonstrate the limitations of existing defenses against our attacks, highlighting the difficulty of protecting machine learning against this threat.