Improving Adversarial Robustness by Putting More Regularizations on Less Robust Samples

TOP 文献データベース Improving Adversarial Robustness by Putting More Regularizations on Less Robust Samples

International Conference on Machine Learning (ICML)

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2206.03353

PDF

https://arxiv.org/pdf/2206.03353

文献情報

作者: Dongyoon Yang;Insung Kong;Yongdai Kim
公開日: 2022-6-7
更新日: 2023-6-1
所属機関: Department of Statistics, Seoul National University
所属の国: Republic of Korea
会議名: International Conference on Machine Learning (ICML)

AIにより推定されたラベル

敵対的サンプルロバスト性敵対的攻撃手法

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Adversarial training, which is to enhance robustness against adversarial attacks, has received much attention because it is easy to generate human-imperceptible perturbations of data to deceive a given deep neural network. In this paper, we propose a new adversarial training algorithm that is theoretically well motivated and empirically superior to other existing algorithms. A novel feature of the proposed algorithm is to apply more regularization to data vulnerable to adversarial attacks than other existing regularization algorithms do. Theoretically, we show that our algorithm can be understood as an algorithm of minimizing the regularized empirical risk motivated from a newly derived upper bound of the robust risk. Numerical experiments illustrate that our proposed algorithm improves the generalization (accuracy on examples) and robustness (accuracy on adversarial attacks) simultaneously to achieve the state-of-the-art performance.