Regularization for Adversarial Robust Learning | AIセキュリティポータル

EN

JA

EN

TOP 文献データベース Regularization for Adversarial Robust Learning

arxiv

Regularization for Adversarial Robust Learning

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2408.09672

PDF

https://arxiv.org/pdf/2408.09672

文献情報

作者: Jie Wang;Rui Gao;Yao Xie
公開日: 2024-8-19
更新日: 2024-8-22
所属機関: School of Industrial and Systems Engineering, Georgia Institute of Technology
所属の国: United States of America
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

ポイズニングアルゴリズム正則化

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Despite the growing prevalence of artificial neural networks in real-world applications, their vulnerability to adversarial attacks remains a significant concern, which motivates us to investigate the robustness of machine learning models. While various heuristics aim to optimize the distributionally robust risk using the $\infty$-Wasserstein metric, such a notion of robustness frequently encounters computation intractability. To tackle the computational challenge, we develop a novel approach to adversarial training that integrates $\phi$-divergence regularization into the distributionally robust risk function. This regularization brings a notable improvement in computation compared with the original formulation. We develop stochastic gradient methods with biased oracles to solve this problem efficiently, achieving the near-optimal sample complexity. Moreover, we establish its regularization effects and demonstrate it is asymptotic equivalence to a regularized empirical risk minimization framework, by considering various scaling regimes of the regularization parameter and robustness level. These regimes yield gradient norm regularization, variance regularization, or a smoothed gradient norm regularization that interpolates between these extremes. We numerically validate our proposed method in supervised learning, reinforcement learning, and contextual learning and showcase its state-of-the-art performance against various adversarial attacks.

外部データセット

MNIST

Fashion-MNIST

Kuzushiji-MNIST

参考文献

On the convergence of SGD with biased gradients

Ajalloeian, A., Stich, S. U.

Published: 2020

Algorithmic Learning Theory

Improved generalization bounds for robust learning

Idan Attias, Aryeh Kontorovich, Yishay Mansour

Published: 2019

International Conference on Machine Learning (ICML)

Adversarial Learning Guarantees for Linear Hypotheses and Neural Networks

Pranjal Awasthi, Natalie Frank, Mehryar Mohri

Published: 2020.4.29

Adversarial or test time robustness measures the susceptibility of a classifier to perturbations to the test input. While there has been a flurry of recent work on designing defenses against such perturbations, the theory of adversarial robustness is not well understood. In order to make progress on this, we focus on the problem of understanding generalization in adversarial settings, via the lens of Rademacher complexity. We give upper and lower bounds for the adversarial empirical Rademacher complexity of linear hypotheses with adversarial perturbations measured in $l_r$-norm for an arbitrary $r \geq 1$. This generalizes the recent result of [Yin et al.'19] that studies the case of $r = \infty$, and provides a finer analysis of the dependence on the input dimensionality as compared to the recent work of [Khim and Loh'19] on linear hypothesis classes. We then extend our analysis to provide Rademacher complexity lower and upper bounds for a single ReLU unit. Finally, we give adversarial Rademacher complexity bounds for feed-forward neural networks with one hidden layer. Unlike previous works we directly provide bounds on the adversarial Rademacher complexity of the given network, as opposed to a bound on a surrogate. A by-product of our analysis also leads to tighter bounds for the Rademacher complexity of linear hypotheses, for which we give a detailed analysis and present a comparison with existing bounds.

ロバスト性向上手法形式的検証敵対的攻撃検出

ESAIM: Control, Optimisation and Calculus of Variations

Regularization for wasserstein distributionally robust optimization

Azizian W, Iutzeler F, Malick J

Published: 2023

Advances in Neural Information Processing Systems

Spectrally-normalized margin bounds for neural networks

Bartlett, P.L., Foster, D.J., Telgarsky, M.J.

Published: 2017

The Operations Research Revolution

Data-driven stochastic programming using phi-divergences

Bayraksan G, Love DK

Published: 2015

Management Science

Robust solutions of optimization problems affected by uncertain probabilities

Ben-Tal A, den Hertog D, De Waegenaere A, Melenberg B, Rennen G

Published: 2013

Mathematics of Operations Research

Penalty functions and duality in stochastic programming via φ-divergence functionals

Ben-Tal A, Teboulle M

Published: 1987

Management Science

From predictive to prescriptive analytics

Bertsimas D, Kallus N

Published: 2020

Mathematical programming

Persistence in discrete optimization under data uncertainty

Bertsimas D, Natarajan K, Teo CP

Published: 2006