Smooth Adversarial Training

Authors: Cihang Xie, Mingxing Tan, Boqing Gong, Alan Yuille, Quoc V. Le | Published: 2020-06-25 | Updated: 2021-07-11

2020.06.252025.04.03

Authors: Cihang Xie, Mingxing Tan, Boqing Gong, Alan Yuille, Quoc V. Le
Published: 2020-06-25 | Updated: 2021-07-11

Source: https://arxiv.org/abs/2006.14536

PDF: https://arxiv.org/pdf/2006.14536

AIにより推定されたラベル

ロバスト性敵対的学習敵対的サンプル

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

It is commonly believed that networks cannot be both accurate and robust, that gaining robustness means losing accuracy. It is also generally believed that, unless making networks larger, network architectural elements would otherwise matter little in improving adversarial robustness. Here we present evidence to challenge these common beliefs by a careful study about adversarial training. Our key observation is that the widely-used ReLU activation function significantly weakens adversarial training due to its non-smooth nature. Hence we propose smooth adversarial training (SAT), in which we replace ReLU with its smooth approximations to strengthen adversarial training. The purpose of smooth activation functions in SAT is to allow it to find harder adversarial examples and compute better gradient updates during adversarial training. Compared to standard adversarial training, SAT improves adversarial robustness for “free”, i.e., no drop in accuracy and no increase in computational cost. For example, without introducing additional computations, SAT significantly enhances ResNet-50’s robustness from 33.0 also improving accuracy by 0.9 networks: it helps EfficientNet-L1 to achieve 82.2 robustness on ImageNet, outperforming the previous state-of-the-art defense by 9.5 https://github.com/cihangxie/SmoothAdversarialTraining.