Expressive Losses for Verified Robustness via Convex Combinations

TOP 文献データベース Expressive Losses for Verified Robustness via Convex Combinations

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2305.13991

PDF

https://arxiv.org/pdf/2305.13991

文献情報

作者: Alessandro De Palma;Rudy Bunel;Krishnamurthy Dvijotham;M. Pawan Kumar;Robert Stanforth;Alessio Lomuscio
公開日: 2023-5-23
更新日: 2024-3-18
所属機関: Inria, École Normale Supérieure, PSL University, CNRS
所属の国: France
会議名: International Conference on Learning Representations (ICLR)

AIにより推定されたラベル

機械学習手法深層学習手法パラメータ調整

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

In order to train networks for verified adversarial robustness, it is common to over-approximate the worst-case loss over perturbation regions, resulting in networks that attain verifiability at the expense of standard performance. As shown in recent work, better trade-offs between accuracy and robustness can be obtained by carefully coupling adversarial training with over-approximations. We hypothesize that the expressivity of a loss function, which we formalize as the ability to span a range of trade-offs between lower and upper bounds to the worst-case loss through a single parameter (the over-approximation coefficient), is key to attaining state-of-the-art performance. To support our hypothesis, we show that trivial expressive losses, obtained via convex combinations between adversarial attacks and IBP bounds, yield state-of-the-art results across a variety of settings in spite of their conceptual simplicity. We provide a detailed analysis of the relationship between the over-approximation coefficient and performance profiles across different expressive losses, showing that, while expressivity is essential, better approximations of the worst-case loss are not necessarily linked to superior robustness-accuracy trade-offs.

外部データセット

MNIST

CIFAR-10

TinyImageNet

downscaled ImageNet