Data Poisoning against Differentially-Private Learners: Attacks and Defenses

Authors: Yuzhe Ma, Xiaojin Zhu, Justin Hsu | Published: 2019-03-23 | Updated: 2019-07-05

2019.03.232025.04.03

Authors: Yuzhe Ma, Xiaojin Zhu, Justin Hsu
Published: 2019-03-23 | Updated: 2019-07-05

Source: https://arxiv.org/abs/1903.09860

PDF: https://arxiv.org/pdf/1903.09860

AIにより推定されたラベル

敵対的攻撃検出バックドア攻撃用の毒データの検知未ターゲット毒性攻撃

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Data poisoning attacks aim to manipulate the model produced by a learning algorithm by adversarially modifying the training set. We consider differential privacy as a defensive measure against this type of attack. We show that such learners are resistant to data poisoning attacks when the adversary is only able to poison a small number of items. However, this protection degrades as the adversary poisons more data. To illustrate, we design attack algorithms targeting objective and output perturbation learners, two standard approaches to differentially-private machine learning. Experiments show that our methods are effective when the attacker is allowed to poison sufficiently many training items.