Defending Against Physically Realizable Attacks on Image Classification

TOP 文献データベース Defending Against Physically Realizable Attacks on Image Classification

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1909.09552

PDF

https://arxiv.org/pdf/1909.09552

文献情報

作者: Tong Wu,Liang Tong,Yevgeniy Vorobeychik
公開日: 2019-9-21
更新日: 2020-2-15
所属機関: Department of Computer Science and Engineering
所属の国: United States of America
会議名

AIにより推定されたラベル

ポイズニング敵対的攻撃攻撃の分類

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

We study the problem of defending deep neural network approaches for image classification from physically realizable attacks. First, we demonstrate that the two most scalable and effective methods for learning robust models, adversarial training with PGD attacks and randomized smoothing, exhibit very limited effectiveness against three of the highest profile physical attacks. Next, we propose a new abstract adversarial model, rectangular occlusion attacks, in which an adversary places a small adversarially crafted rectangle in an image, and develop two approaches for efficiently computing the resulting adversarial examples. Finally, we demonstrate that adversarial training using our new attack yields image classification models that exhibit high robustness against the physically realizable attacks we study, offering the first effective generic defense against such attacks.