REGroup: Rank-aggregating Ensemble of Generative Classifiers for Robust Predictions

TOP 文献データベース REGroup: Rank-aggregating Ensemble of Generative Classifiers for Robust Predictions

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2006.10679

PDF

https://arxiv.org/pdf/2006.10679

文献情報

作者: Lokender Tiwari;Anish Madan;Saket Anand;Subhashis Banerjee
公開日: 2020-6-19
更新日: 2021-11-24
所属機関: IIIT-Delhi
所属の国: India
会議名: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

AIにより推定されたラベル

敵対的サンプル敵対的学習ポイズニング

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Deep Neural Networks (DNNs) are often criticized for being susceptible to adversarial attacks. Most successful defense strategies adopt adversarial training or random input transformations that typically require retraining or fine-tuning the model to achieve reasonable performance. In this work, our investigations of intermediate representations of a pre-trained DNN lead to an interesting discovery pointing to intrinsic robustness to adversarial attacks. We find that we can learn a generative classifier by statistically characterizing the neural response of an intermediate layer to clean training samples. The predictions of multiple such intermediate-layer based classifiers, when aggregated, show unexpected robustness to adversarial attacks. Specifically, we devise an ensemble of these generative classifiers that rank-aggregates their predictions via a Borda count-based consensus. Our proposed approach uses a subset of the clean training data and a pre-trained model, and yet is agnostic to network architectures or the adversarial attack generation method. We show extensive experiments to establish that our defense strategy achieves state-of-the-art performance on the ImageNet validation set.

外部データセット

ImageNet

V50K

V10K

V2K

V10C

CIFAR-10