Black-Box Adversarial Attack with Transferable Model-based Embedding

TOP 文献データベース Black-Box Adversarial Attack with Transferable Model-based Embedding

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1911.07140

PDF

https://arxiv.org/pdf/1911.07140

文献情報

作者: Zhichao Huang,Tong Zhang
公開日: 2019-11-17
更新日: 2020-1-5
所属機関: The Hong Kong University of Science and Technology
所属の国: Hong Kong
会議名

AIにより推定されたラベル

敵対的攻撃手法敵対的サンプル知識移転性

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

We present a new method for black-box adversarial attack. Unlike previous methods that combined transfer-based and scored-based methods by using the gradient or initialization of a surrogate white-box model, this new method tries to learn a low-dimensional embedding using a pretrained model, and then performs efficient search within the embedding space to attack an unknown target network. The method produces adversarial perturbations with high level semantic patterns that are easily transferable. We show that this approach can greatly improve the query efficiency of black-box adversarial attack across different target network architectures. We evaluate our approach on MNIST, ImageNet and Google Cloud Vision API, resulting in a significant reduction on the number of queries. We also attack adversarially defended networks on CIFAR10 and ImageNet, where our method not only reduces the number of queries, but also improves the attack success rate.

外部データセット

MNIST

ImageNet

CIFAR10