On the (Un-)Avoidability of Adversarial Examples

TOP 文献データベース On the (Un-)Avoidability of Adversarial Examples

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2106.13326

PDF

https://arxiv.org/pdf/2106.13326

文献情報

作者: Sadia Chowdhury;Ruth Urner
公開日: 2021-6-25
所属機関: Lassonde School of Engineering, EECS Department, York University
所属の国: Canada
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

敵対的サンプルロバスト性評価機械学習アルゴリズム

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

The phenomenon of adversarial examples in deep learning models has caused substantial concern over their reliability. While many deep neural networks have shown impressive performance in terms of predictive accuracy, it has been shown that in many instances an imperceptible perturbation can falsely flip the network's prediction. Most research has then focused on developing defenses against adversarial attacks or learning under a worst-case adversarial loss. In this work, we take a step back and aim to provide a framework for determining whether a model's label change under small perturbation is justified (and when it is not). We carefully argue that adversarial robustness should be defined as a locally adaptive measure complying with the underlying distribution. We then suggest a definition for an adaptive robust loss, derive an empirical version of it, and develop a resulting data-augmentation framework. We prove that our adaptive data-augmentation maintains consistency of 1-nearest neighbor classification under deterministic labels and provide illustrative empirical evaluations.