Backdoors in Neural Models of Source Code

TOP 文献データベース Backdoors in Neural Models of Source Code

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2006.06841

PDF

https://arxiv.org/pdf/2006.06841

文献情報

作者: Goutham Ramakrishnan,Aws Albarghouthi
公開日: 2020-6-12
所属機関: University of Wisconsin–Madison
所属の国: United States of America
会議名: International Conference on Pattern Recognition (ICPR)

AIにより推定されたラベル

バックドア攻撃ポイズニングプログラム解析

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Deep neural networks are vulnerable to a range of adversaries. A particularly pernicious class of vulnerabilities are backdoors, where model predictions diverge in the presence of subtle triggers in inputs. An attacker can implant a backdoor by poisoning the training data to yield a desired target prediction on triggered inputs. We study backdoors in the context of deep-learning for source code. (1) We define a range of backdoor classes for source-code tasks and show how to poison a dataset to install such backdoors. (2) We adapt and improve recent algorithms from robust statistics for our setting, showing that backdoors leave a spectral signature in the learned representation of source code, thus enabling detection of poisoned data. (3) We conduct a thorough evaluation on different architectures and languages, showing the ease of injecting backdoors and our ability to eliminate them.