Diffuse or Confuse: A Diffusion Deepfake Speech Dataset

TOP 文献データベース Diffuse or Confuse: A Diffusion Deepfake Speech Dataset

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2410.06796

PDF

https://arxiv.org/pdf/2410.06796

文献情報

作者: Anton Firc;Kamil Malinka;Petr Hanáček
公開日: 2024-10-9
所属機関: Faculty of Information Technology, Brno University of Technology
所属の国: Czech Republic
会議名: International Conference of the Biometrics Special Interest Group (BIOSIG)

AIにより推定されたラベル

音声合成技術データセット生成モデル性能評価

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Advancements in artificial intelligence and machine learning have significantly improved synthetic speech generation. This paper explores diffusion models, a novel method for creating realistic synthetic speech. We create a diffusion dataset using available tools and pretrained models. Additionally, this study assesses the quality of diffusion-generated deepfakes versus non-diffusion ones and their potential threat to current deepfake detection systems. Findings indicate that the detection of diffusion-based deepfakes is generally comparable to non-diffusion deepfakes, with some variability based on detector architecture. Re-vocoding with diffusion vocoders shows minimal impact, and the overall speech quality is comparable to non-diffusion methods.

外部データセット

ASVSpoof2019

LJSpeech