AIセキュリティポータル K Program
Toward More Generalized Malicious URL Detection Models
Share
Abstract
This paper reveals a data bias issue that can severely affect the performance while conducting a machine learning model for malicious URL detection. We describe how such bias can be identified using interpretable machine learning techniques, and further argue that such biases naturally exist in the real world security data for training a classification model. We then propose a debiased training strategy that can be applied to most deep-learning based models to alleviate the negative effects from the biased features. The solution is based on the technique of self-supervised adversarial training to train deep neural networks learning invariant embedding from biased data. We conduct a wide range of experiments to demonstrate that the proposed strategy can lead to significantly better generalization capability for both CNN-based and RNN-based detection models.
Turning a blind eye: Explicit removal of biases and variation from deep neural network embeddings
Alvi, M., Zisserman, A., Nellaker, C.
Published: 2018
Malware classification with LSTM and GRU language models and a character-level CNN
Athiwaratkun, B., Stokes, J. W.
Published: 2017
Data decisions and theoretical implications when adversarially learning fair representations
A. Beutel, J. Chen, Z. Zhao, E.H. Chi
Published: 2017
Malware analysis and classification: A survey
Gandotra, E., Bansal, D., Sofat, S.
Published: 2014
Domain-adversarial training of neural networks
Ganin, Y., Ustinova, E., Ajakan, H., Germain, P., Larochelle, H., Laviolette, F., Marchand, M., Lempitsky, V.
Published: 2017
Learning Not to Learn: Training Deep Neural Networks with Biased Data
Kim, B., Kim, H., Kim, K., Kim, S., Kim, J.
Published: 2019
URLNet: Learning a URL Representation with Deep Learning for Malicious URL Detection
Hung Le, Quang Pham, Doyen Sahoo, Steven C. H. Hoi
Published: 2.9.2018
Domain generalization with adversarial feature learning
Li, H., Jialin Pan, S., Wang, S., Kot, A. C.
Published: 2018
Malware classification with recurrent networks
Pascanu, R., Stokes, J. W., Sanossian, H., Marinescu, M., Thomas, A.
Published: 2015
Axiomatic attribution for deep networks
Sundararajan, M., Taly, A., Yan, Q.
Published: 2017
Toward an Effective Black-Box Adversarial Attack on Functional JavaScript Malware against Commercial Anti-Virus
Tsai, Y.-D., Chen, C., Lin, S.-D.
Published: 2021
Controllable invariance through adversarial feature learning
Xie, Q., Dai, Z., Du, Y., Hovy, E., Neubig, G.
Published: 2017
Share