Is feature selection secure against training data poisoning?

TOP 文献データベース Is feature selection secure against training data poisoning?

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1804.07933

PDF

https://arxiv.org/pdf/1804.07933

文献情報

作者: Huang Xiao,Battista Biggio,Gavin Brown,Giorgio Fumera,Claudia Eckert,Fabio Roli
公開日: 2018-4-21
所属機関: Department of Computer Science, Technische Universitat Munchen
所属の国: Germany
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

毒データの検知毒性攻撃に特化した内容ポイズニング

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Learning in adversarial settings is becoming an important task for application domains where attackers may inject malicious data into the training set to subvert normal operation of data-driven technologies. Feature selection has been widely used in machine learning for security applications to improve generalization and computational efficiency, although it is not clear whether its use may be beneficial or even counterproductive when training data are poisoned by intelligent attackers. In this work, we shed light on this issue by providing a framework to investigate the robustness of popular feature selection methods, including LASSO, ridge regression and the elastic net. Our results on malware detection show that feature selection methods can be significantly compromised under attack (we can reduce LASSO to almost random choices of feature sets by careful insertion of less than 5% poisoned training samples), highlighting the need for specific countermeasures.

外部データセット

Contagio dataset