Understanding and Quantifying Adversarial Examples Existence in Linear Classification

TOP 文献データベース Understanding and Quantifying Adversarial Examples Existence in Linear Classification

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1910.12163

PDF

https://arxiv.org/pdf/1910.12163

文献情報

作者: Xupeng Shi,A. Adam Ding
公開日: 2019-10-27
所属機関: Department of Mathematics, Northeastern University
所属の国: United States of America
会議名: International Conference on Machine Learning and Computing (ICMLC)

AIにより推定されたラベル

敵対的サンプル線形モデル防御手法の効果分析

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

State-of-art deep neural networks (DNN) are vulnerable to attacks by adversarial examples: a carefully designed small perturbation to the input, that is imperceptible to human, can mislead DNN. To understand the root cause of adversarial examples, we quantify the probability of adversarial example existence for linear classifiers. Previous mathematical definition of adversarial examples only involves the overall perturbation amount, and we propose a more practical relevant definition of strong adversarial examples that separately limits the perturbation along the signal direction also. We show that linear classifiers can be made robust to strong adversarial examples attack in cases where no adversarial robust linear classifiers exist under the previous definition. The quantitative formulas are confirmed by numerical experiments using a linear support vector machine (SVM) classifier. The results suggest that designing general strong-adversarial-robust learning systems is feasible but only through incorporating human knowledge of the underlying classification problem.

外部データセット

MNIST