Expressive Losses for Verified Robustness via Convex Combinations

TOP Literature Database Expressive Losses for Verified Robustness via Convex Combinations

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2305.13991

PDF

https://arxiv.org/pdf/2305.13991

Paper Information

Author: Alessandro De Palma;Rudy Bunel;Krishnamurthy Dvijotham;M. Pawan Kumar;Robert Stanforth;Alessio Lomuscio
Published: 5-23-2023
Updated: 3-18-2024
Affiliation: Inria, École Normale Supérieure, PSL University, CNRS
Country: France
Conference: International Conference on Learning Representations (ICLR)

Labels Estimated by AI

Machine Learning Method Deep Learning Method Parameter Tuning

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

In order to train networks for verified adversarial robustness, it is common to over-approximate the worst-case loss over perturbation regions, resulting in networks that attain verifiability at the expense of standard performance. As shown in recent work, better trade-offs between accuracy and robustness can be obtained by carefully coupling adversarial training with over-approximations. We hypothesize that the expressivity of a loss function, which we formalize as the ability to span a range of trade-offs between lower and upper bounds to the worst-case loss through a single parameter (the over-approximation coefficient), is key to attaining state-of-the-art performance. To support our hypothesis, we show that trivial expressive losses, obtained via convex combinations between adversarial attacks and IBP bounds, yield state-of-the-art results across a variety of settings in spite of their conceptual simplicity. We provide a detailed analysis of the relationship between the over-approximation coefficient and performance profiles across different expressive losses, showing that, while expressivity is essential, better approximations of the worst-case loss are not necessarily linked to superior robustness-accuracy trade-offs.

External Datasets

MNIST

CIFAR-10

TinyImageNet

downscaled ImageNet