AIセキュリティポータル K Program
Improving behavior based authentication against adversarial attack using XAI
Share
Abstract
In recent years, machine learning models, especially deep neural networks, have been widely used for classification tasks in the security domain. However, these models have been shown to be vulnerable to adversarial manipulation: small changes learned by an adversarial attack model, when applied to the input, can cause significant changes in the output. Most research on adversarial attacks and corresponding defense methods focuses only on scenarios where adversarial samples are directly generated by the attack model. In this study, we explore a more practical scenario in behavior-based authentication, where adversarial samples are collected from the attacker. The generated adversarial samples from the model are replicated by attackers with a certain level of discrepancy. We propose an eXplainable AI (XAI) based defense strategy against adversarial attacks in such scenarios. A feature selector, trained with our method, can be used as a filter in front of the original authenticator. It filters out features that are more vulnerable to adversarial attacks or irrelevant to authentication, while retaining features that are more robust. Through comprehensive experiments, we demonstrate that our XAI based defense strategy is effective against adversarial attacks and outperforms other defense strategies, such as adversarial training and defensive distillation.
Dns tunneling detection through statistical fingerprints of protocol messages and machine learning
M. Aiello, M. Mongelli, G. Papaleo
Published: 2015
Adversarial machine learning: A comparative study on contemporary intrusion detection datasets
Y. Pacheco, W. Sun
Published: 2021
The limitations of deep learning in adversarial settings
S. Sabour, Y. Cao, F. Faghri, D. J. Fleet
The mythos of model interpretability: In machine learning, the concept of interpretability is both important and slippery
Z. C. Lipton
Published: 2018
Mastering the game of go without human knowledge
D. Silver, J. Schrittwieser, K. Simonyan, I. Antonoglou, A. Huang, A. Guez, T. Hubert, L. Baker, M. Lai, A. Bolton
Published: 2017
Techniques for interpretable machine learning
M. Du, N. Liu, X. Hu
Published: 2019
Real time image saliency for black box classifiers
P. Dabkowski, Y. Gal
Published: 2017
Learning to explain: An information-theoretic perspective on model interpretation
J. Chen, L. Song, M. Wainwright, M. Jordan
Published: 2018
Invase: Instance-wise variable selection using neural networks
J. Yoon, J. Jordon, M. van der Schaar
Published: 2018
Differentiated explanation of deep neural networks with skewed distributions
W. Fu, M. Wang, M. Du, N. Liu, S. Hao, X. Hu
Published: 2021
Handedness matters for motor control but not for prediction
J. Mathew, F. R. Sarlegna, P.-M. Bernier, F. R. Danion
Published: 2019
A comprehensive and reliable feature attribution method: Double-sided remove and reconstruct (DoRaR)
D. Qin, G. T. Amariucai, D. Qiao, Y. Guan, S. Fu
Published: 2024
Artificial intelligence meets kinesthetic intelligence: Mouse-based user authentication based on hybrid human-machine learning
S. Fu, D. Qin, G. Amariucai, D. Qiao, Y. Guan, A. Smiley
Published: 2022
Deepfool: a simple and accurate method to fool deep neural networks
S.-M. Moosavi-Dezfooli, A. Fawzi, P. Frossard
Published: 2016
Intriguing properties of neural networks
C. Szegedy, W. Zaremba, I. Sutskever, J. Bruna, D. Erhan, I. Goodfellow, R. Fergus
Published: 2014
Distillation as a Defense to Adversarial Perturbations against Deep Neural Networks
Nicolas Papernot, Patrick McDaniel, Xi Wu, Somesh Jha, Ananthram Swami
Published: 2015.11.14
Distilling the knowledge in a neural network
G. Hinton, O. Vinyals, J. Dean
Published: 2015
Towards Evaluating the Robustness of Neural Networks
Nicholas Carlini, David Wagner
Published: 2016.8.17
Evade hard multiple classifier systems
B. Biggio, G. Fumera, F. Roli
Published: 2009
Security evaluation of pattern classifiers under attack
Battista Biggio, Giorgio Fumera, Fabio Roli
Published: 2013
Feature cross-substitution in adversarial classification
B. Li, Y. Vorobeychik
Published: 2014
Adversarial Feature Selection against Evasion Attacks
Fei Zhang, Patrick P. K. Chan, Battista Biggio, Daniel S. Yeung, Fabio Roli
Published: 2020.5.26
Sparse feature attacks in adversarial learning
Z. Yin, F. Wang, W. Liu, S. Chawla
Published: 2018
Share