Robustifying automatic speech recognition by extracting slowly varying features

TOP 文献データベース Robustifying automatic speech recognition by extracting slowly varying features

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2112.07400

PDF

https://arxiv.org/pdf/2112.07400

文献情報

作者: Matías Pizarro;Dorothea Kolossa;Asja Fischer
公開日: 2021-12-14
更新日: 2024-11-6
所属機関: Ruhr University Bochum
所属の国: Germany
会議名

AIにより推定されたラベル

防御手法ポイズニング敵対的訓練

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

In the past few years, it has been shown that deep learning systems are highly vulnerable under attacks with adversarial examples. Neural-network-based automatic speech recognition (ASR) systems are no exception. Targeted and untargeted attacks can modify an audio input signal in such a way that humans still recognise the same words, while ASR systems are steered to predict a different transcription. In this paper, we propose a defense mechanism against targeted adversarial attacks consisting in removing fast-changing features from the audio signals, either by applying slow feature analysis, a low-pass filter, or both, before feeding the input to the ASR system. We perform an empirical analysis of hybrid ASR models trained on data pre-processed in such a way. While the resulting models perform quite well on benign data, they are significantly more robust against targeted adversarial attacks: Our final, proposed model shows a performance on clean data similar to the baseline model, while being more than four times more robust.