Detecting Adversarial Examples via Key-based Network

TOP 文献データベース Detecting Adversarial Examples via Key-based Network

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1806.00580

PDF

https://arxiv.org/pdf/1806.00580

文献情報

作者: Pinlong Zhao,Zhouyu Fu,Ou wu,Qinghua Hu,Jun Wang
公開日: 2018-6-2
所属機関: Tianjin University
所属の国: China
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

透かし評価敵対的学習敵対的移転性

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Though deep neural networks have achieved state-of-the-art performance in visual classification, recent studies have shown that they are all vulnerable to the attack of adversarial examples. Small and often imperceptible perturbations to the input images are sufficient to fool the most powerful deep neural networks. Various defense methods have been proposed to address this issue. However, they either require knowledge on the process of generating adversarial examples, or are not robust against new attacks specifically designed to penetrate the existing defense. In this work, we introduce key-based network, a new detection-based defense mechanism to distinguish adversarial examples from normal ones based on error correcting output codes, using the binary code vectors produced by multiple binary classifiers applied to randomly chosen label-sets as signatures to match normal images and reject adversarial examples. In contrast to existing defense methods, the proposed method does not require knowledge of the process for generating adversarial examples and can be applied to defend against different types of attacks. For the practical black-box and gray-box scenarios, where the attacker does not know the encoding scheme, we show empirically that key-based network can effectively detect adversarial examples generated by several state-of-the-art attacks.

外部データセット

MNIST