Towards Understanding Limitations of Pixel Discretization Against Adversarial Attacks

TOP 文献データベース Towards Understanding Limitations of Pixel Discretization Against Adversarial Attacks

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1805.07816

PDF

https://arxiv.org/pdf/1805.07816

文献情報

作者: Jiefeng Chen,Xi Wu,Vaibhav Rastogi,Yingyu Liang,Somesh Jha
公開日: 2018-5-21
更新日: 2019-10-4
所属機関: University of Wisconsin-Madison
所属の国: United States of America
会議名

AIにより推定されたラベル

モデルの堅牢性モデル抽出攻撃データ前処理

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Wide adoption of artificial neural networks in various domains has led to an increasing interest in defending adversarial attacks against them. Preprocessing defense methods such as pixel discretization are particularly attractive in practice due to their simplicity, low computational overhead, and applicability to various systems. It is observed that such methods work well on simple datasets like MNIST, but break on more complicated ones like ImageNet under recently proposed strong white-box attacks. To understand the conditions for success and potentials for improvement, we study the pixel discretization defense method, including more sophisticated variants that take into account the properties of the dataset being discretized. Our results again show poor resistance against the strong attacks. We analyze our results in a theoretical framework and offer strong evidence that pixel discretization is unlikely to work on all but the simplest of the datasets. Furthermore, our arguments present insights why some other preprocessing defenses may be insecure.

外部データセット

MNIST

Fashion-MNIST

CIFAR-10

GTSRB

ImageNet