A High Dimensional Statistical Model for Adversarial Training: Geometry and Trade-Offs

TOP 文献データベース A High Dimensional Statistical Model for Adversarial Training: Geometry and Trade-Offs

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2402.05674

PDF

https://arxiv.org/pdf/2402.05674

文献情報

作者: Kasimir Tanner;Matteo Vilucchio;Bruno Loureiro;Florent Krzakala
公開日: 2024-2-8
更新日: 2024-12-28
所属機関: Information Learning and Physics Laboratory, École Polytechnique Fédérale de Lausanne (EPFL)
所属の国: Switzerland
会議名: International Conference on Artificial Intelligence and Statistics (AISTATS)

AIにより推定されたラベル

損失関数収束特性ウォーターマーキング

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

This work investigates adversarial training in the context of margin-based linear classifiers in the high-dimensional regime where the dimension $d$ and the number of data points $n$ diverge with a fixed ratio $\alpha = n / d$. We introduce a tractable mathematical model where the interplay between the data and adversarial attacker geometries can be studied, while capturing the core phenomenology observed in the adversarial robustness literature. Our main theoretical contribution is an exact asymptotic description of the sufficient statistics for the adversarial empirical risk minimiser, under generic convex and non-increasing losses for a Block Feature Model. Our result allow us to precisely characterise which directions in the data are associated with a higher generalisation/robustness trade-off, as defined by a robustness and a usefulness metric. We show that the the presence of multiple different feature types is crucial to the high sample complexity performances of adversarial training. In particular, we unveil the existence of directions which can be defended without penalising accuracy. Finally, we show the advantage of defending non-robust features during training, identifying a uniform protection as an inherently effective defence mechanism.

外部データセット

CIFAR10

FashionMNIST