Poisoning Attacks with Generative Adversarial Nets

TOP 文献データベース Poisoning Attacks with Generative Adversarial Nets

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1906.07773

PDF

https://arxiv.org/pdf/1906.07773

文献情報

作者: Luis Muñoz-González,Bjarne Pfitzner,Matteo Russo,Javier Carnerero-Cano,Emil C. Lupu
公開日: 2019-6-19
更新日: 2019-9-26
所属機関: Department of Computing, Imperial College London
所属の国: United Kingdom
会議名

AIにより推定されたラベル

バックドア攻撃生成的敵対ネットワーク攻撃手法

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Machine learning algorithms are vulnerable to poisoning attacks: An adversary can inject malicious points in the training dataset to influence the learning process and degrade the algorithm's performance. Optimal poisoning attacks have already been proposed to evaluate worst-case scenarios, modelling attacks as a bi-level optimization problem. Solving these problems is computationally demanding and has limited applicability for some models such as deep networks. In this paper we introduce a novel generative model to craft systematic poisoning attacks against machine learning classifiers generating adversarial training examples, i.e. samples that look like genuine data points but that degrade the classifier's accuracy when used for training. We propose a Generative Adversarial Net with three components: generator, discriminator, and the target classifier. This approach allows us to model naturally the detectability constrains that can be expected in realistic attacks and to identify the regions of the underlying data distribution that can be more vulnerable to data poisoning. Our experimental evaluation shows the effectiveness of our attack to compromise machine learning classifiers, including deep networks.