Toward More Generalized Malicious URL Detection Models

TOP 文献データベース Toward More Generalized Malicious URL Detection Models

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2202.10027

PDF

https://arxiv.org/pdf/2202.10027

文献情報

作者: YunDa Tsai;Cayon Liow;Yin Sheng Siang;Shou-De Lin
公開日: 2022-2-21
更新日: 2024-2-10
所属機関: National Taiwan University
所属の国: Taiwan
会議名: AAAI Conference on Artificial Intelligence (AAAI)

AIにより推定されたラベル

バイアストークン分布分析一般化の影響

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

This paper reveals a data bias issue that can severely affect the performance while conducting a machine learning model for malicious URL detection. We describe how such bias can be identified using interpretable machine learning techniques, and further argue that such biases naturally exist in the real world security data for training a classification model. We then propose a debiased training strategy that can be applied to most deep-learning based models to alleviate the negative effects from the biased features. The solution is based on the technique of self-supervised adversarial training to train deep neural networks learning invariant embedding from biased data. We conduct a wide range of experiments to demonstrate that the proposed strategy can lead to significantly better generalization capability for both CNN-based and RNN-based detection models.