Towards Understanding Fast Adversarial Training

TOP 文献データベース Towards Understanding Fast Adversarial Training

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2006.03089

PDF

https://arxiv.org/pdf/2006.03089

文献情報

作者: Bai Li,Shiqi Wang,Suman Jana,Lawrence Carin
公開日: 2020-6-5
所属機関: Department of Statistical Science, Duke University
所属の国: United States of America
会議名

AIにより推定されたラベル

敵対的サンプル敵対的攻撃検出学習の改善

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Current neural-network-based classifiers are susceptible to adversarial examples. The most empirically successful approach to defending against such adversarial examples is adversarial training, which incorporates a strong self-attack during training to enhance its robustness. This approach, however, is computationally expensive and hence is hard to scale up. A recent work, called fast adversarial training, has shown that it is possible to markedly reduce computation time without sacrificing significant performance. This approach incorporates simple self-attacks, yet it can only run for a limited number of training epochs, resulting in sub-optimal performance. In this paper, we conduct experiments to understand the behavior of fast adversarial training and show the key to its success is the ability to recover from overfitting to weak attacks. We then extend our findings to improve fast adversarial training, demonstrating superior robust accuracy to strong adversarial training, with much-reduced training time.