Adversarial Surrogate Risk Bounds for Binary Classification

TOP Literature Database Adversarial Surrogate Risk Bounds for Binary Classification

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2506.09348

PDF

https://arxiv.org/pdf/2506.09348

Paper Information

Author: Natalie S. Frank
Published: 6-11-2025
Affiliation: Department of Applied Mathematics, University of Washington
Country: United States of America
Conference

Labels Estimated by AI

Certified Robustness Convergence Analysis Function Boundary Pair Formation

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

A central concern in classification is the vulnerability of machine learning models to adversarial attacks. Adversarial training is one of the most popular techniques for training robust classifiers, which involves minimizing an adversarial surrogate risk. Recent work characterized when a minimizing sequence of an adversarial surrogate risk is also a minimizing sequence of the adversarial classification risk for binary classification -- a property known as adversarial consistency. However, these results do not address the rate at which the adversarial classification risk converges to its optimal value for such a sequence of functions that minimize the adversarial surrogate. This paper provides surrogate risk bounds that quantify that convergence rate. Additionally, we derive distribution-dependent surrogate risk bounds in the standard (non-adversarial) learning setting, that may be of independent interest.