Can We Mitigate Backdoor Attack Using Adversarial Detection Methods?

TOP 文献データベース Can We Mitigate Backdoor Attack Using Adversarial Detection Methods?

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2006.14871

PDF

https://arxiv.org/pdf/2006.14871

文献情報

作者: Kaidi Jin;Tianwei Zhang;Chao Shen;Yufei Chen;Ming Fan;Chenhao Lin;Ting Liu
公開日: 2020-6-26
更新日: 2022-7-29
所属機関: Xi'an Jiaotong University
所属の国: China
会議名: IEEE Trans. Dependable Secur. Comput.

AIにより推定されたラベル

バックドア攻撃敵対的サンプルの検知敵対的攻撃

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Deep Neural Networks are well known to be vulnerable to adversarial attacks and backdoor attacks, where minor modifications on the input are able to mislead the models to give wrong results. Although defenses against adversarial attacks have been widely studied, investigation on mitigating backdoor attacks is still at an early stage. It is unknown whether there are any connections and common characteristics between the defenses against these two attacks. We conduct comprehensive studies on the connections between adversarial examples and backdoor examples of Deep Neural Networks to seek to answer the question: can we detect backdoor using adversarial detection methods. Our insights are based on the observation that both adversarial examples and backdoor examples have anomalies during the inference process, highly distinguishable from benign samples. As a result, we revise four existing adversarial defense methods for detecting backdoor examples. Extensive evaluations indicate that these approaches provide reliable protection against backdoor attacks, with a higher accuracy than detecting adversarial examples. These solutions also reveal the relations of adversarial examples, backdoor examples and normal samples in model sensitivity, activation space and feature space. This is able to enhance our understanding about the inherent features of these two attacks and the defense opportunities.