Revisiting Ensembles in an Adversarial Context: Improving Natural Accuracy

TOP 文献データベース Revisiting Ensembles in an Adversarial Context: Improving Natural Accuracy

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2002.11572

PDF

https://arxiv.org/pdf/2002.11572

文献情報

作者: Aditya Saligrama,Guillaume Leclerc
公開日: 2020-2-27
所属機関: MIT PRIMES
所属の国: United States of America
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

ロバスト性評価敵対的訓練性能評価

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

A necessary characteristic for the deployment of deep learning models in real world applications is resistance to small adversarial perturbations while maintaining accuracy on non-malicious inputs. While robust training provides models that exhibit better adversarial accuracy than standard models, there is still a significant gap in natural accuracy between robust and non-robust models which we aim to bridge. We consider a number of ensemble methods designed to mitigate this performance difference. Our key insight is that model trained to withstand small attacks, when ensembled, can often withstand significantly larger attacks, and this concept can in turn be leveraged to optimize natural accuracy. We consider two schemes, one that combines predictions from several randomly initialized robust models, and the other that fuses features from robust and standard models.