Shield: Fast, Practical Defense and Vaccination for Deep Learning using JPEG Compression

TOP 文献データベース Shield: Fast, Practical Defense and Vaccination for Deep Learning using JPEG Compression

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1802.06816

PDF

https://arxiv.org/pdf/1802.06816

文献情報

作者: Nilaksh Das,Madhuri Shanbhogue,Shang-Tse Chen,Fred Hohman,Siwei Li,Li Chen,Michael E. Kounavis,Duen Horng Chau
公開日: 2018-2-20
所属機関: Georgia Institute of Technology, Atlanta, GA, USA
所属の国: United States of America
会議名: ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD)

AIにより推定されたラベル

モデルの頑健性保証敵対的攻撃機械学習手法

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

The rapidly growing body of research in adversarial machine learning has demonstrated that deep neural networks (DNNs) are highly vulnerable to adversarially generated images. This underscores the urgent need for practical defense that can be readily deployed to combat attacks in real-time. Observing that many attack strategies aim to perturb image pixels in ways that are visually imperceptible, we place JPEG compression at the core of our proposed Shield defense framework, utilizing its capability to effectively "compress away" such pixel manipulation. To immunize a DNN model from artifacts introduced by compression, Shield "vaccinates" a model by re-training it with compressed images, where different compression levels are applied to generate multiple vaccinated models that are ultimately used together in an ensemble defense. On top of that, Shield adds an additional layer of protection by employing randomization at test time that compresses different regions of an image using random compression levels, making it harder for an adversary to estimate the transformation performed. This novel combination of vaccination, ensembling, and randomization makes Shield a fortified multi-pronged protection. We conducted extensive, large-scale experiments using the ImageNet dataset, and show that our approaches eliminate up to 94% of black-box attacks and 98% of gray-box attacks delivered by the recent, strongest attacks, such as Carlini-Wagner's L2 and DeepFool. Our approaches are fast and work without requiring knowledge about the model.

外部データセット

ImageNet