FAdeML: Understanding the Impact of Pre-Processing Noise Filtering on Adversarial Machine Learning

TOP 文献データベース FAdeML: Understanding the Impact of Pre-Processing Noise Filtering on Adversarial Machine Learning

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1811.01444

PDF

https://arxiv.org/pdf/1811.01444

文献情報

作者: Faiq Khalid,Muhammmad Abdullah Hanif,Semeen Rehman,Junaid Qadir,Muhammad Shafique
公開日: 2018-11-5
所属機関: Vienna University of Technology
所属の国: Austria
会議名: Design, Automation, and Test in Europe (DATE)

AIにより推定されたラベル

モデル抽出攻撃攻撃の評価防御手法

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Deep neural networks (DNN)-based machine learning (ML) algorithms have recently emerged as the leading ML paradigm particularly for the task of classification due to their superior capability of learning efficiently from large datasets. The discovery of a number of well-known attacks such as dataset poisoning, adversarial examples, and network manipulation (through the addition of malicious nodes) has, however, put the spotlight squarely on the lack of security in DNN-based ML systems. In particular, malicious actors can use these well-known attacks to cause random/targeted misclassification, or cause a change in the prediction confidence, by only slightly but systematically manipulating the environmental parameters, inference data, or the data acquisition block. Most of the prior adversarial attacks have, however, not accounted for the pre-processing noise filters commonly integrated with the ML-inference module. Our contribution in this work is to show that this is a major omission since these noise filters can render ineffective the majority of the existing attacks, which rely essentially on introducing adversarial noise. Apart from this, we also extend the state of the art by proposing a novel pre-processing noise Filter-aware Adversarial ML attack called FAdeML. To demonstrate the effectiveness of the proposed methodology, we generate an adversarial attack image by exploiting the "VGGNet" DNN trained for the "German Traffic Sign Recognition Benchmarks (GTSRB" dataset, which despite having no visual noise, can cause a classifier to misclassify even in the presence of pre-processing noise filters.

外部データセット

German Traffic Sign Recognition Benchmarks (GTSRB)