Identifying Audio Adversarial Examples via Anomalous Pattern Detection

TOP 文献データベース Identifying Audio Adversarial Examples via Anomalous Pattern Detection

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2002.05463

PDF

https://arxiv.org/pdf/2002.05463

文献情報

作者: Victor Akinwande,Celia Cintas,Skyler Speakman,Srihari Sridharan
公開日: 2020-2-13
更新日: 2020-7-25
所属機関: IBM Research Africa
所属の国: United States of America
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

敵対的攻撃手法敵対的サンプルの脆弱性機械学習の応用

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Audio processing models based on deep neural networks are susceptible to adversarial attacks even when the adversarial audio waveform is 99.9% similar to a benign sample. Given the wide application of DNN-based audio recognition systems, detecting the presence of adversarial examples is of high practical relevance. By applying anomalous pattern detection techniques in the activation space of these models, we show that 2 of the recent and current state-of-the-art adversarial attacks on audio processing systems systematically lead to higher-than-expected activation at some subset of nodes and we can detect these with up to an AUC of 0.98 with no degradation in performance on benign samples.