Byzantine Failures Harm the Generalization of Robust Distributed Learning Algorithms More Than Data Poisoning

Authors: Thomas Boudou, Batiste Le Bars, Nirupam Gupta, Aurélien Bellet | Published: 2025-06-22 | Updated: 2025-10-16

2025.06.22

Authors: Thomas Boudou, Batiste Le Bars, Nirupam Gupta, Aurélien Bellet
Published: 2025-06-22 | Updated: 2025-10-16

Source: https://arxiv.org/abs/2506.18020

PDF: https://arxiv.org/pdf/2506.18020

AIにより推定されたラベル

ポイズニングビザンチン耐性データセットの影響

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Robust distributed learning algorithms aim to maintain reliable performance despite the presence of misbehaving workers. Such misbehaviors are commonly modeled as Byzantine failures, allowing arbitrarily corrupted communication, or as data poisoning, a weaker form of corruption restricted to local training data. While prior work shows similar optimization guarantees for both models, an important question remains: How do these threat models impact generalization? Empirical evidence suggests a gap, yet it remains unclear whether it is unavoidable or merely an artifact of suboptimal attacks. We show, for the first time, a fundamental gap in generalization guarantees between the two threat models: Byzantine failures yield strictly worse rates than those achievable under data poisoning. Our findings leverage a tight algorithmic stability analysis of robust distributed learning. Specifically, we prove that: (i) under data poisoning, the uniform algorithmic stability of an algorithm with optimal optimization guarantees degrades by an additive factor of $\varTheta ( \frac{f}{n-f} )$, with f out of n workers misbehaving; whereas (ii) under Byzantine failures, the degradation is in $\Omega \big( \sqrt{ \frac{f}{n-2f}} \big)$.