Understanding Adversarial Robustness: The Trade-off between Minimum and Average Margin

TOP 文献データベース Understanding Adversarial Robustness: The Trade-off between Minimum and Average Margin

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1907.11780

PDF

https://arxiv.org/pdf/1907.11780

文献情報

作者: Kaiwen Wu,Yaoliang Yu
公開日: 2019-7-27
所属機関
所属の国
会議名

AIにより推定されたラベル

敵対的サンプルトレードオフ分析トレーニング手法

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Deep models, while being extremely versatile and accurate, are vulnerable to adversarial attacks: slight perturbations that are imperceptible to humans can completely flip the prediction of deep models. Many attack and defense mechanisms have been proposed, although a satisfying solution still largely remains elusive. In this work, we give strong evidence that during training, deep models maximize the minimum margin in order to achieve high accuracy, but at the same time decrease the \emph{average} margin hence hurting robustness. Our empirical results highlight an intrinsic trade-off between accuracy and robustness for current deep model training. To further address this issue, we propose a new regularizer to explicitly promote average margin, and we verify through extensive experiments that it does lead to better robustness. Our regularized objective remains Fisher-consistent, hence asymptotically can still recover the Bayes optimal classifier.