Learning to Learn by Zeroth-Order Oracle

TOP 文献データベース Learning to Learn by Zeroth-Order Oracle

International Conference on Learning Representations (ICLR)

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1910.09464

PDF

https://arxiv.org/pdf/1910.09464

文献情報

作者: Yangjun Ruan,Yuanhao Xiong,Sashank Reddi,Sanjiv Kumar,Cho-Jui Hsieh
公開日: 2025-3-25
所属機関: Department of Infomation Science and Electrical Engineering, Zhejiang University
所属の国: China
会議名: International Conference on Learning Representations (ICLR)

AIにより推定されたラベル

最適化アルゴリズムの選択と評価性能評価損失項

Abstract

In the learning to learn (L2L) framework, we cast the design of optimization algorithms as a machine learning problem and use deep neural networks to learn the update rules. In this paper, we extend the L2L framework to zeroth-order (ZO) optimization setting, where no explicit gradient information is available. Our learned optimizer, modeled as recurrent neural network (RNN), first approximates gradient by ZO gradient estimator and then produces parameter update utilizing the knowledge of previous iterations. To reduce high variance effect due to ZO gradient estimator, we further introduce another RNN to learn the Gaussian sampling rule and dynamically guide the query direction sampling. Our learned optimizer outperforms hand-designed algorithms in terms of convergence rate and final solution on both synthetic and practical ZO optimization tasks (in particular, the black-box adversarial attack task, which is one of the most widely used tasks of ZO optimization). We finally conduct extensive analytical experiments to demonstrate the effectiveness of our proposed optimizer.