Improving behavior based authentication against adversarial attack using XAI

International Journal of Communication Systems

Dns tunneling detection through statistical fingerprints of protocol messages and machine learning

M. Aiello, M. Mongelli, G. Papaleo

Published: 2015

arxiv

被引用数 1

Adversarial Attacks and Defenses on Graphs: A Review, A Tool and Empirical Studies

Wei Jin, Yaxin Li, Han Xu, Yiqi Wang, Shuiwang Ji, Charu Aggarwal, Jiliang Tang

Published: 2020.3.2

Deep neural networks (DNNs) have achieved significant performance in various tasks. However, recent studies have shown that DNNs can be easily fooled by small perturbation on the input, called adversarial attacks. As the extensions of DNNs to graphs, Graph Neural Networks (GNNs) have been demonstrated to inherit this vulnerability. Adversary can mislead GNNs to give wrong predictions by modifying the graph structure such as manipulating a few edges. This vulnerability has arisen tremendous concerns for adapting GNNs in safety-critical applications and has attracted increasing research attention in recent years. Thus, it is necessary and timely to provide a comprehensive overview of existing graph adversarial attacks and the countermeasures. In this survey, we categorize existing attacks and defenses, and review the corresponding state-of-the-art methods. Furthermore, we have developed a repository with representative algorithms (https://github.com/DSE-MSU/DeepRobust/tree/master/deeprobust/graph). The repository enables us to conduct empirical studies to deepen our understandings on attacks and defenses on graphs.

ポイズニング敵対的学習敵対的サンプル

7th International Conference on Information Systems Security and Privacy

Adversarial machine learning: A comparative study on contemporary intrusion detection datasets

Y. Pacheco, W. Sun

Published: 2021

The limitations of deep learning in adversarial settings

S. Sabour, Y. Cao, F. Faghri, D. J. Fleet

CoRR

Explaining explanations in AI

B. D. Mittelstadt, C. Russell, S. Wachter

Published: 2018

Queue

The mythos of model interpretability: In machine learning, the concept of interpretability is both important and slippery

Z. C. Lipton

Published: 2018

nature

Mastering the game of go without human knowledge

D. Silver, J. Schrittwieser, K. Simonyan, I. Antonoglou, A. Huang, A. Guez, T. Hubert, L. Baker, M. Lai, A. Bolton

Published: 2017

Communications of the ACM

Techniques for interpretable machine learning

M. Du, N. Liu, X. Hu

Published: 2019

Advances in neural information processing systems

Real time image saliency for black box classifiers

P. Dabkowski, Y. Gal

Published: 2017

International Conference on Machine Learning, PMLR

Learning to explain: An information-theoretic perspective on model interpretation

J. Chen, L. Song, M. Wainwright, M. Jordan

Published: 2018

International Conference on Learning Representations

Invase: Instance-wise variable selection using neural networks

J. Yoon, J. Jordon, M. van der Schaar

Published: 2018

IEEE Transactions on Pattern Analysis and Machine Intelligence

Differentiated explanation of deep neural networks with skewed distributions

W. Fu, M. Wang, M. Du, N. Liu, S. Hao, X. Hu

Published: 2021

eneuro

Handedness matters for motor control but not for prediction

J. Mathew, F. R. Sarlegna, P.-M. Bernier, F. R. Danion

Published: 2019

Neural Networks

A comprehensive and reliable feature attribution method: Double-sided remove and reconstruct (DoRaR)

D. Qin, G. T. Amariucai, D. Qiao, Y. Guan, S. Fu

Published: 2024

Proceedings of the 2022 ACM on Asia Conference on Computer and Communications Security

Artificial intelligence meets kinesthetic intelligence: Mouse-based user authentication based on hybrid human-machine learning

S. Fu, D. Qin, G. Amariucai, D. Qiao, Y. Guan, A. Smiley

Published: 2022

Explaining and Harnessing Adversarial Examples

I. J. Goodfellow, J. Shlens, C. Szegedy

Published: 2014

CVPR

Deepfool: a simple and accurate method to fool deep neural networks

S.-M. Moosavi-Dezfooli, A. Fawzi, P. Frossard

Published: 2016

ICLR

Intriguing properties of neural networks

C. Szegedy, W. Zaremba, I. Sutskever, J. Bruna, D. Erhan, I. Goodfellow, R. Fergus

Published: 2014

arxiv

被引用数 1

IEEE Symposium on Security and Privacy

Distillation as a Defense to Adversarial Perturbations against Deep Neural Networks

Nicolas Papernot, Patrick McDaniel, Xi Wu, Somesh Jha, Ananthram Swami

Published: 2015.11.14

Deep learning algorithms have been shown to perform extremely well on many classical machine learning problems. However, recent studies have shown that deep learning, like other machine learning techniques, is vulnerable to adversarial samples: inputs crafted to force a deep neural network (DNN) to provide adversary-selected outputs. Such attacks can seriously undermine the security of the system supported by the DNN, sometimes with devastating consequences. For example, autonomous vehicles can be crashed, illicit or illegal content can bypass content filters, or biometric authentication systems can be manipulated to allow improper access. In this work, we introduce a defensive mechanism called defensive distillation to reduce the effectiveness of adversarial samples on DNNs. We analytically investigate the generalizability and robustness properties granted by the use of defensive distillation when training DNNs. We also empirically study the effectiveness of our defense mechanisms on two DNNs placed in adversarial settings. The study shows that defensive distillation can reduce effectiveness of sample creation from 95% to less than 0.5% on a studied DNN. Such dramatic gains can be explained by the fact that distillation leads gradients used in adversarial sample creation to be reduced by a factor of 10^30. We also find that distillation increases the average minimum number of features that need to be modified to create adversarial samples by about 800% on one of the DNNs we tested.

モデルの頑健性保証深層学習敵対的サンプル

Computer Science

Distilling the knowledge in a neural network

G. Hinton, O. Vinyals, J. Dean

Published: 2015

arxiv

被引用数 1

IEEE Symposium on Security and Privacy

Towards Evaluating the Robustness of Neural Networks

Nicholas Carlini, David Wagner

Published: 2016.8.17

Neural networks provide state-of-the-art results for most machine learning tasks. Unfortunately, neural networks are vulnerable to adversarial examples: given an input $x$ and any target classification $t$, it is possible to find a new input $x'$ that is similar to $x$ but classified as $t$. This makes it difficult to apply neural networks in security-critical areas. Defensive distillation is a recently proposed approach that can take an arbitrary neural network, and increase its robustness, reducing the success rate of current attacks' ability to find adversarial examples from $95\%$ to $0.5\%$. In this paper, we demonstrate that defensive distillation does not significantly increase the robustness of neural networks by introducing three new attack algorithms that are successful on both distilled and undistilled neural networks with $100\%$ probability. Our attacks are tailored to three distance metrics used previously in the literature, and when compared to previous adversarial example generation algorithms, our attacks are often much more effective (and never worse). Furthermore, we propose using high-confidence adversarial examples in a simple transferability test we show can also be used to break defensive distillation. We hope our attacks will be used as a benchmark in future defense attempts to create neural networks that resist adversarial examples.

モデルの堅牢性敵対的サンプルモデルの頑健性保証

Studies in Computational Intelligence

Evade hard multiple classifier systems

B. Biggio, G. Fumera, F. Roli

Published: 2009

IEEE Transactions on Knowledge and Data Engineering

Security evaluation of pattern classifiers under attack

Battista Biggio, Giorgio Fumera, Fabio Roli

Published: 2013

Neural Information Processing Systems

Feature cross-substitution in adversarial classification

B. Li, Y. Vorobeychik