A Method for Computing Class-wise Universal Adversarial Perturbations

TOP 文献データベース A Method for Computing Class-wise Universal Adversarial Perturbations

Computing Research Repository (CoRR)

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1912.00466

PDF

https://arxiv.org/pdf/1912.00466

文献情報

作者: Tejus Gupta,Abhishek Sinha,Nupur Kumari,Mayank Singh,Balaji Krishnamurthy
公開日: 2019-12-2
所属機関: Adobe Inc.
所属の国: India
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

敵対的サンプルの脆弱性敵対的サンプル深層学習

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

We present an algorithm for computing class-specific universal adversarial perturbations for deep neural networks. Such perturbations can induce misclassification in a large fraction of images of a specific class. Unlike previous methods that use iterative optimization for computing a universal perturbation, the proposed method employs a perturbation that is a linear function of weights of the neural network and hence can be computed much faster. The method does not require any training data and has no hyper-parameters. The attack obtains 34% to 51% fooling rate on state-of-the-art deep neural networks on ImageNet and transfers across models. We also study the characteristics of the decision boundaries learned by standard and adversarially trained models to understand the universal adversarial perturbations.