Misleading Authorship Attribution of Source Code using Adversarial Learning

TOP 文献データベース Misleading Authorship Attribution of Source Code using Adversarial Learning

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1905.12386

PDF

https://arxiv.org/pdf/1905.12386

文献情報

作者: Erwin Quiring,Alwin Maier,Konrad Rieck
公開日: 2019-5-29
更新日: 2019-5-31
所属機関: Technische Universitat Braunschweig
所属の国: Germany
会議名

AIにより推定されたラベル

著者帰属手法敵対的攻撃攻撃の評価

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

In this paper, we present a novel attack against authorship attribution of source code. We exploit that recent attribution methods rest on machine learning and thus can be deceived by adversarial examples of source code. Our attack performs a series of semantics-preserving code transformations that mislead learning-based attribution but appear plausible to a developer. The attack is guided by Monte-Carlo tree search that enables us to operate in the discrete domain of source code. In an empirical evaluation with source code from 204 programmers, we demonstrate that our attack has a substantial effect on two recent attribution methods, whose accuracy drops from over 88% to 1% under attack. Furthermore, we show that our attack can imitate the coding style of developers with high accuracy and thereby induce false attributions. We conclude that current approaches for authorship attribution are inappropriate for practical application and there is a need for resilient analysis techniques.