Sparse Perturbations for Improved Convergence in Stochastic Zeroth-Order Optimization

TOP 文献データベース Sparse Perturbations for Improved Convergence in Stochastic Zeroth-Order Optimization

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2006.01759

PDF

https://arxiv.org/pdf/2006.01759

文献情報

作者: Mayumi Ohta,Nathaniel Berger,Artem Sokolov,Stefan Riezler
公開日: 2020-6-3
更新日: 2020-6-29
所属機関: Department of Computational Linguistics, Heidelberg University
所属の国: Germany
会議名: International Conference on Machine Learning, Optimization, and Data Science (LOD)

AIにより推定されたラベル

スパースモデル学習の改善アルゴリズム

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Interest in stochastic zeroth-order (SZO) methods has recently been revived in black-box optimization scenarios such as adversarial black-box attacks to deep neural networks. SZO methods only require the ability to evaluate the objective function at random input points, however, their weakness is the dependency of their convergence speed on the dimensionality of the function to be evaluated. We present a sparse SZO optimization method that reduces this factor to the expected dimensionality of the random perturbation during learning. We give a proof that justifies this reduction for sparse SZO optimization for non-convex functions without making any assumptions on sparsity of objective function or gradient. Furthermore, we present experimental results for neural networks on MNIST and CIFAR that show faster convergence in training loss and test accuracy, and a smaller distance of the gradient approximation to the true gradient in sparse SZO compared to dense SZO.