Outlier Robust Adversarial Training

TOP Literature Database Outlier Robust Adversarial Training

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2309.05145

PDF

https://arxiv.org/pdf/2309.05145

Paper Information

Author: Shu Hu;Zhenhuan Yang;Xin Wang;Yiming Ying;Siwei Lyu
Published: 9-11-2023
Affiliation: Department of Computer and Information Technology, Purdue School of Engineering and Technology, Indiana University–Purdue University Indianapolis
Country: United States of America
Conference

Labels Estimated by AI

Adversarial attack Loss Term Convergence Property

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Supervised learning models are challenged by the intrinsic complexities of training data such as outliers and minority subpopulations and intentional attacks at inference time with adversarial samples. While traditional robust learning methods and the recent adversarial training approaches are designed to handle each of the two challenges, to date, no work has been done to develop models that are robust with regard to the low-quality training data and the potential adversarial attack at inference time simultaneously. It is for this reason that we introduce Outlier Robust Adversarial Training (ORAT) in this work. ORAT is based on a bi-level optimization formulation of adversarial training with a robust rank-based loss function. Theoretically, we show that the learning objective of ORAT satisfies the $\mathcal{H}$-consistency in binary classification, which establishes it as a proper surrogate to adversarial 0/1 loss. Furthermore, we analyze its generalization ability and provide uniform convergence rates in high probability. ORAT can be optimized with a simple algorithm. Experimental evaluations on three benchmark datasets demonstrate the effectiveness and robustness of ORAT in handling outliers and adversarial attacks. Our code is available at https://github.com/discovershu/ORAT.

External Datasets

MNIST

CIFAR-10

CIFAR-100