Certified Defenses for Adversarial Patches

TOP 文献データベース Certified Defenses for Adversarial Patches

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2003.06693

PDF

https://arxiv.org/pdf/2003.06693

文献情報

作者: Ping-Yeh Chiang,Renkun Ni,Ahmed Abdelkader,Chen Zhu,Christoph Studer,Tom Goldstein
公開日: 2020-3-15
更新日: 2020-9-26
所属機関: University of Maryland, College Park
所属の国: United States of America
会議名: International Conference on Learning Representations (ICLR)

AIにより推定されたラベル

防御手法脆弱性攻撃手法ロバスト性

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Adversarial patch attacks are among one of the most practical threat models against real-world computer vision systems. This paper studies certified and empirical defenses against patch attacks. We begin with a set of experiments showing that most existing defenses, which work by pre-processing input images to mitigate adversarial patches, are easily broken by simple white-box adversaries. Motivated by this finding, we propose the first certified defense against patch attacks, and propose faster methods for its training. Furthermore, we experiment with different patch shapes for testing, obtaining surprisingly good robustness transfer across shapes, and present preliminary results on certified defense against sparse attacks. Our complete implementation can be found on: https://github.com/Ping-C/certifiedpatchdefense.