Universal adversarial examples in speech command classification

TOP 文献データベース Universal adversarial examples in speech command classification

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1911.10182

PDF

https://arxiv.org/pdf/1911.10182

文献情報

作者: Jon Vadillo;Roberto Santana
公開日: 2019-11-23
更新日: 2021-2-13
所属機関: Department of Computer Science and Artificial Intelligence
所属の国: Spain
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

敵対的サンプル敵対的攻撃手法研究方法論

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Adversarial examples are inputs intentionally perturbed with the aim of forcing a machine learning model to produce a wrong prediction, while the changes are not easily detectable by a human. Although this topic has been intensively studied in the image domain, classification tasks in the audio domain have received less attention. In this paper we address the existence of universal perturbations for speech command classification. We provide evidence that universal attacks can be generated for speech command classification tasks, which are able to generalize across different models to a significant extent. Additionally, a novel analytical framework is proposed for the evaluation of universal perturbations under different levels of universality, demonstrating that the feasibility of generating effective perturbations decreases as the universality level increases. Finally, we propose a more detailed and rigorous framework to measure the amount of distortion introduced by the perturbations, demonstrating that the methods employed by convention are not realistic in audio-based problems.