Sampling without Replacement Leads to Faster Rates in Finite-Sum Minimax Optimization

TOP 文献データベース Sampling without Replacement Leads to Faster Rates in Finite-Sum Minimax Optimization

Conference on Neural Information Processing Systems (NeurIPS)

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2206.02953

PDF

https://arxiv.org/pdf/2206.02953

文献情報

作者: Aniket Das;Bernhard Schölkopf;Michael Muehlebach
公開日: 2022-6-7
更新日: 2022-10-10
所属機関: Indian Institute of Technology Kanpur
所属の国: India
会議名: Conference on Neural Information Processing Systems (NeurIPS)

AIにより推定されたラベル

収束性分析形式的検証関数の定義

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

We analyze the convergence rates of stochastic gradient algorithms for smooth finite-sum minimax optimization and show that, for many such algorithms, sampling the data points without replacement leads to faster convergence compared to sampling with replacement. For the smooth and strongly convex-strongly concave setting, we consider gradient descent ascent and the proximal point method, and present a unified analysis of two popular without-replacement sampling strategies, namely Random Reshuffling (RR), which shuffles the data every epoch, and Single Shuffling or Shuffle Once (SO), which shuffles only at the beginning. We obtain tight convergence rates for RR and SO and demonstrate that these strategies lead to faster convergence than uniform sampling. Moving beyond convexity, we obtain similar results for smooth nonconvex-nonconcave objectives satisfying a two-sided Polyak-{\L}ojasiewicz inequality. Finally, we demonstrate that our techniques are general enough to analyze the effect of data-ordering attacks, where an adversary manipulates the order in which data points are supplied to the optimizer. Our analysis also recovers tight rates for the incremental gradient method, where the data points are not shuffled at all.