Generate (non-software) Bugs to Fool Classifiers

TOP 文献データベース Generate (non-software) Bugs to Fool Classifiers

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1911.08644

PDF

https://arxiv.org/pdf/1911.08644

文献情報

作者: Hiromu Yakura,Youhei Akimoto,Jun Sakuma
公開日: 2019-11-20
所属機関: University of Tsukuba
所属の国: Japan
会議名: AAAI Conference on Artificial Intelligence (AAAI)

AIにより推定されたラベル

敵対的サンプル敵対的攻撃手法モデル性能評価

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

In adversarial attacks intended to confound deep learning models, most studies have focused on limiting the magnitude of the modification so that humans do not notice the attack. On the other hand, during an attack against autonomous cars, for example, most drivers would not find it strange if a small insect image were placed on a stop sign, or they may overlook it. In this paper, we present a systematic approach to generate natural adversarial examples against classification models by employing such natural-appearing perturbations that imitate a certain object or signal. We first show the feasibility of this approach in an attack against an image classifier by employing generative adversarial networks that produce image patches that have the appearance of a natural object to fool the target model. We also introduce an algorithm to optimize placement of the perturbation in accordance with the input image, which makes the generation of adversarial examples fast and likely to succeed. Moreover, we experimentally show that the proposed approach can be extended to the audio domain, for example, to generate perturbations that sound like the chirping of birds to fool a speech classifier.

外部データセット

ImageNet

moths from Costa Rica

German Traffic Sign Recognition Benchmark

VB100 Bird Dataset

Speech Commands Dataset