Improved Adversarial Training via Learned Optimizer

TOP 文献データベース Improved Adversarial Training via Learned Optimizer

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2004.12227

PDF

https://arxiv.org/pdf/2004.12227

文献情報

作者: Yuanhao Xiong,Cho-Jui Hsieh
公開日: 2020-4-26
所属機関: University of California, Los Angeles
所属の国: United States of America
会議名: European Conference on Computer Vision (ECCV)

AIにより推定されたラベル

適応型敵対的訓練ポイズニング最適化問題

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Adversarial attack has recently become a tremendous threat to deep learning models. To improve the robustness of machine learning models, adversarial training, formulated as a minimax optimization problem, has been recognized as one of the most effective defense mechanisms. However, the non-convex and non-concave property poses a great challenge to the minimax training. In this paper, we empirically demonstrate that the commonly used PGD attack may not be optimal for inner maximization, and improved inner optimizer can lead to a more robust model. Then we leverage a learning-to-learn (L2L) framework to train an optimizer with recurrent neural networks, providing update directions and steps adaptively for the inner problem. By co-training optimizer's parameters and model's weights, the proposed framework consistently improves the model robustness over PGD-based adversarial training and TRADES.