Active Learning Under Malicious Mislabeling and Poisoning Attacks

TOP 文献データベース Active Learning Under Malicious Mislabeling and Poisoning Attacks

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2101.00157

PDF

https://arxiv.org/pdf/2101.00157

文献情報

作者: Jing Lin;Ryan Luley;Kaiqi Xiong
公開日: 2021-1-1
更新日: 2021-9-2
所属機関: ICNS Lab and Cyber Florida, University of South Florida
所属の国: United States of America
会議名: Global Communications Conference (GLOBECOM)

AIにより推定されたラベル

ポイズニング性能評価バックドア攻撃

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Deep neural networks usually require large labeled datasets for training to achieve state-of-the-art performance in many tasks, such as image classification and natural language processing. Although a lot of data is created each day by active Internet users, most of these data are unlabeled and are vulnerable to data poisoning attacks. In this paper, we develop an efficient active learning method that requires fewer labeled instances and incorporates the technique of adversarial retraining in which additional labeled artificial data are generated without increasing the budget of the labeling. The generated adversarial examples also provide a way to measure the vulnerability of the model. To check the performance of the proposed method under an adversarial setting, i.e., malicious mislabeling and data poisoning attacks, we perform an extensive evaluation on the reduced CIFAR-10 dataset, which contains only two classes: airplane and frog. Our experimental results demonstrate that the proposed active learning method is efficient for defending against malicious mislabeling and data poisoning attacks. Specifically, whereas the baseline active learning method based on the random sampling strategy performs poorly (about 50%) under a malicious mislabeling attack, the proposed active learning method can achieve the desired accuracy of 89% using only one-third of the dataset on average.