A Training Rate and Survival Heuristic for Inference and Robustness Evaluation (TRASHFIRE)

TOP 文献データベース A Training Rate and Survival Heuristic for Inference and Robustness Evaluation (TRASHFIRE)

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2401.13751

PDF

https://arxiv.org/pdf/2401.13751

文献情報

作者: Charles Meyers;Mohammad Reza Saleh Sedghpour;Tommy Löfstedt;Erik Elmroth
公開日: 2024-1-25
更新日: 2024-9-12
所属機関: Department of Computing Science, Umea University
所属の国: Sweden
会議名: International Conference on Machine Learning and Computing (ICMLC)

AIにより推定されたラベル

モデル性能評価敵対的サンプルハイパーパラメータ調整

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Machine learning models -- deep neural networks in particular -- have performed remarkably well on benchmark datasets across a wide variety of domains. However, the ease of finding adversarial counter-examples remains a persistent problem when training times are measured in hours or days and the time needed to find a successful adversarial counter-example is measured in seconds. Much work has gone into generating and defending against these adversarial counter-examples, however the relative costs of attacks and defences are rarely discussed. Additionally, machine learning research is almost entirely guided by test/train metrics, but these would require billions of samples to meet industry standards. The present work addresses the problem of understanding and predicting how particular model hyper-parameters influence the performance of a model in the presence of an adversary. The proposed approach uses survival models, worst-case examples, and a cost-aware analysis to precisely and accurately reject a particular model change during routine model training procedures rather than relying on real-world deployment, expensive formal verification methods, or accurate simulations of very complicated systems (\textit{e.g.}, digitally recreating every part of a car or a plane). Through an evaluation of many pre-processing techniques, adversarial counter-examples, and neural network configurations, the conclusion is that deeper models do offer marginal gains in survival times compared to more shallow counterparts. However, we show that those gains are driven more by the model inference time than inherent robustness properties. Using the proposed methodology, we show that ResNet is hopelessly insecure against even the simplest of white box attacks.

外部データセット

CIFAR100

CIFAR10

MNIST