AIセキュリティポータル K Program
A Theoretical View of Linear Backpropagation and Its Convergence
Share
Abstract
Backpropagation (BP) is widely used for calculating gradients in deep neural networks (DNNs). Applied often along with stochastic gradient descent (SGD) or its variants, BP is considered as a de-facto choice in a variety of machine learning tasks including DNN training and adversarial attack/defense. Recently, a linear variant of BP named LinBP was introduced for generating more transferable adversarial examples for performing black-box attacks, by Guo et al. Although it has been shown empirically effective in black-box attacks, theoretical studies and convergence analyses of such a method is lacking. This paper serves as a complement and somewhat an extension to Guo et al.'s paper, by providing theoretical analyses on LinBP in neural-network-involved learning tasks, including adversarial attack and model training. We demonstrate that, somewhat surprisingly, LinBP can lead to faster convergence in these tasks in the same hyper-parameter settings, compared to BP. We confirm our theoretical results with extensive experiments.
Backpropagating linearly improves transferability of adversarial examples
Y. Guo, Q. Li, H. Chen
Published: 2020
Very deep convolutional networks for large-scale image recognition
K. Simonyan, A. Zisserman
Published: 2015
Deep residual learning for image recognition
Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun
Published: 2016
Densely connected convolutional networks
G. Huang, Z. Liu, L. Van Der Maaten, K. Q. Weinberger
Published: 2017
Decoupled weight decay regularization
Ilya Loshchilov, Frank Hutter
Published: 2018
Sgdr: Stochastic gradient descent with warm restarts
I. Loshchilov, F. Hutter
Published: 2017
On the importance of initialization and momentum in deep learning
I. Sutskever, J. Martens, G. Dahl, G. Hinton
Published: 2013
Large-scale machine learning with stochastic gradient descent
L. Bottou
Published: 2010
A theoretical framework for back-propagation
Y. LeCun
Published: 1988
Practical black-box attacks against machine learning
Papernot, N., McDaniel, P., Goodfellow, I., Jha, S., Celik, Z. B., Swami, A.
Published: 2017
Mobilenetv2: Inverted residuals and linear bottlenecks
M. Sandler, A. Howard, M. Zhu, A. Zhmoginov, L.-C. Chen
Published: 2018
Explaining and harnessing adversarial examples
Goodfellow, I. J., Shlens, J., Szegedy, C.
Published: 2015
Adversarial Machine Learning at Scale
Alexey Kurakin, Ian Goodfellow, Samy Bengio
Published: 11.4.2016
Towards Deep Learning Models Resistant to Adversarial Attacks
Aleksander Madry, Aleksandar Makelov, Ludwig Schmidt, Dimitris Tsipras, Adrian Vladu
Published: 6.20.2017
Obfuscated gradients give a false sense of security: Circumventing defenses to adversarial examples
Anish Athalye, Nicholas Carlini, David Wagner
Published: 2018
Deepfool: a simple and accurate method to fool deep neural networks
S.-M. Moosavi-Dezfooli, A. Fawzi, P. Frossard
Published: 2016
Towards Evaluating the Robustness of Neural Networks
Nicholas Carlini, David Wagner
Published: 8.17.2016
Learning multiple layers of features from tiny images
Alex Krizhevsky, Geoffrey Hinton
Published: 2009
Imagenet large scale visual recognition challenge
Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., Huang, Z., Karpathy, A., Khosla, A., Bernstein, M., et al.
Published: 2015
Gradient descent finds global minima of deep neural networks
S. Du, J. Lee, H. Li, L. Wang, X. Zhai
Published: 2019
Fine-grained analysis of optimization and generalization for overparameterized two-layer neural networks
S. Arora, S. Du, W. Hu, Z. Li, R. Wang
Published: 2019
An analytical formula of population gradient for two-layered relu network and its applications in convergence and critical point analysis
Y. Tian
Published: 2017
Gradient descent provably optimizes over-parameterized neural networks
S. S. Du, X. Zhai, B. Poczos, A. Singh
Published: 2019
Geometry-aware instance-reweighted adversarial training
J. Zhang, J. Zhu, G. Niu, B. Han, M. Sugiyama, M. Kankanhalli
Published: 2021
RobustBench: a standardized adversarial robustness benchmark
F. Croce, M. Andriushchenko, V. Sehwag, E. Debenedetti, N. Flammarion, M. Chiang, P. Mittal, M. Hein
Published: 2021
Gradient-based learning applied to document recognition
Y. LeCun, L. Bottou, Y. Bengio, P. Haffner
Published: 1998
Share