MultiRobustBench: Benchmarking Robustness Against Multiple Attacks

TOP 文献データベース MultiRobustBench: Benchmarking Robustness Against Multiple Attacks

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2302.10980

PDF

https://arxiv.org/pdf/2302.10980

文献情報

作者: Sihui Dai;Saeed Mahloujifar;Chong Xiang;Vikash Sehwag;Pin-Yu Chen;Prateek Mittal
公開日: 2023-2-22
更新日: 2023-7-20
所属機関: Electrical and Computer Engineering, Princeton University
所属の国: United States of America
会議名: International Conference on Machine Learning (ICML)

AIにより推定されたラベル

モデル性能評価ポイズニング DNN IP保護手法

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

The bulk of existing research in defending against adversarial examples focuses on defending against a single (typically bounded Lp-norm) attack, but for a practical setting, machine learning (ML) models should be robust to a wide variety of attacks. In this paper, we present the first unified framework for considering multiple attacks against ML models. Our framework is able to model different levels of learner's knowledge about the test-time adversary, allowing us to model robustness against unforeseen attacks and robustness against unions of attacks. Using our framework, we present the first leaderboard, MultiRobustBench, for benchmarking multiattack evaluation which captures performance across attack types and attack strengths. We evaluate the performance of 16 defended models for robustness against a set of 9 different attack types, including Lp-based threat models, spatial transformations, and color changes, at 20 different attack strengths (180 attacks total). Additionally, we analyze the state of current defenses against multiple attacks. Our analysis shows that while existing defenses have made progress in terms of average robustness across the set of attacks used, robustness against the worst-case attack is still a big open problem as all existing models perform worse than random guessing.

外部データセット

CIFAR-10