Curriculum Adversarial Training

TOP 文献データベース Curriculum Adversarial Training

International Joint Conference on Artificial Intelligence (IJCAI)

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1805.04807

PDF

https://arxiv.org/pdf/1805.04807

文献情報

作者: Qi-Zhi Cai,Min Du,Chang Liu,Dawn Song
公開日: 2025-3-25
所属機関: Nanjing University
所属の国: China
会議名: International Joint Conference on Artificial Intelligence (IJCAI)

AIにより推定されたラベル

モデルの堅牢性敵対的学習データキュレーション

Abstract

Recently, deep learning has been applied to many security-sensitive applications, such as facial authentication. The existence of adversarial examples hinders such applications. The state-of-the-art result on defense shows that adversarial training can be applied to train a robust model on MNIST against adversarial examples; but it fails to achieve a high empirical worst-case accuracy on a more complex task, such as CIFAR-10 and SVHN. In our work, we propose curriculum adversarial training (CAT) to resolve this issue. The basic idea is to develop a curriculum of adversarial examples generated by attacks with a wide range of strengths. With two techniques to mitigate the forgetting and the generalization issues, we demonstrate that CAT can improve the prior art's empirical worst-case accuracy by a large margin of 25% on CIFAR-10 and 35% on SVHN. At the same, the model's performance on non-adversarial inputs is comparable to the state-of-the-art models.