Detecting Audio Attacks on ASR Systems with Dropout Uncertainty

Authors: Tejas Jayashankar, Jonathan Le Roux, Pierre Moulin | Published: 2020-06-02 | Updated: 2020-09-15

2020.06.022025.04.03

Authors: Tejas Jayashankar, Jonathan Le Roux, Pierre Moulin
Published: 2020-06-02 | Updated: 2020-09-15

Source: https://arxiv.org/abs/2006.01906

PDF: https://arxiv.org/pdf/2006.01906

AIにより推定されたラベル

敵対的攻撃検出音声アシスタントの誤作動攻撃タイプ

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Various adversarial audio attacks have recently been developed to fool automatic speech recognition (ASR) systems. We here propose a defense against such attacks based on the uncertainty introduced by dropout in neural networks. We show that our defense is able to detect attacks created through optimized perturbations and frequency masking on a state-of-the-art end-to-end ASR system. Furthermore, the defense can be made robust against attacks that are immune to noise reduction. We test our defense on Mozilla’s CommonVoice dataset, the UrbanSound dataset, and an excerpt of the LibriSpeech dataset, showing that it achieves high detection accuracy in a wide range of scenarios.