Intrinsic Certified Robustness of Bagging against Data Poisoning Attacks

TOP 文献データベース Intrinsic Certified Robustness of Bagging against Data Poisoning Attacks

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2008.04495

PDF

https://arxiv.org/pdf/2008.04495

文献情報

作者: Jinyuan Jia,Xiaoyu Cao,Neil Zhenqiang Gong
公開日: 2025-3-25
所属機関: Duke University
所属の国: United States of America
会議名

AIにより推定されたラベル

バックドア攻撃ポイズニング攻撃グループベースの堅牢性

Abstract

In a \emph{data poisoning attack}, an attacker modifies, deletes, and/or inserts some training examples to corrupt the learnt machine learning model. \emph{Bootstrap Aggregating (bagging)} is a well-known ensemble learning method, which trains multiple base models on random subsamples of a training dataset using a base learning algorithm and uses majority vote to predict labels of testing examples. We prove the intrinsic certified robustness of bagging against data poisoning attacks. Specifically, we show that bagging with an arbitrary base learning algorithm provably predicts the same label for a testing example when the number of modified, deleted, and/or inserted training examples is bounded by a threshold. Moreover, we show that our derived threshold is tight if no assumptions on the base learning algorithm are made. We evaluate our method on MNIST and CIFAR10. For instance, our method achieves a certified accuracy of $91.1\%$ on MNIST when arbitrarily modifying, deleting, and/or inserting 100 training examples. Code is available at: \url{https://github.com/jjy1994/BaggingCertifyDataPoisoning}.