An Analysis of Recent Advances in Deepfake Image Detection in an Evolving Threat Landscape | AI Security Portal

JA

JA

EN

TOP Literature Database An Analysis of Recent Advances in Deepfake Image Detection in an Evolving Threat Landscape

arxiv

An Analysis of Recent Advances in Deepfake Image Detection in an Evolving Threat Landscape

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2404.16212

PDF

https://arxiv.org/pdf/2404.16212

Paper Information

Author: Sifat Muhammad Abdullah;Aravind Cheruvu;Shravya Kanchi;Taejoong Chung;Peng Gao;Murtuza Jadliwala;Bimal Viswanath
Published: 4-25-2024
Affiliation: Virginia Tech
Country: United States of America
Conference: SP

Labels Estimated by AI

Poisoning Defense Method Watermark Evaluation

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Deepfake or synthetic images produced using deep generative models pose serious risks to online platforms. This has triggered several research efforts to accurately detect deepfake images, achieving excellent performance on publicly available deepfake datasets. In this work, we study 8 state-of-the-art detectors and argue that they are far from being ready for deployment due to two recent developments. First, the emergence of lightweight methods to customize large generative models, can enable an attacker to create many customized generators (to create deepfakes), thereby substantially increasing the threat surface. We show that existing defenses fail to generalize well to such \emph{user-customized generative models} that are publicly available today. We discuss new machine learning approaches based on content-agnostic features, and ensemble modeling to improve generalization performance against user-customized models. Second, the emergence of \textit{vision foundation models} -- machine learning models trained on broad data that can be easily adapted to several downstream tasks -- can be misused by attackers to craft adversarial deepfakes that can evade existing defenses. We propose a simple adversarial attack that leverages existing foundation models to craft adversarial samples \textit{without adding any adversarial noise}, through careful semantic manipulation of the image content. We highlight the vulnerabilities of several defenses against our attack, and explore directions leveraging advanced foundation models and adversarial training to defend against this new threat.

External Datasets

LAION-AESTHETICS

Flickr-Faces-HQ

StyleCLIP dataset

SD dataset

References

Generative AI: A New Frontier in Artificial Intelligence — Deloitte Ireland

Published: 2023

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

High-resolution image synthesis with latent diffusion models

Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, Björn Ommer

Published: 2022

International Conference on Machine Learning

Zero-shot text-to-image generation

Aditya Ramesh, Mikhail Pavlov, Gabriel Goh, Scott Gray, Chelsea Voss, Alec Radford, Mark Chen, Ilya Sutskever

Published: 2021

Proc. of ICCV

StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery

O. Patashnik, Z. Wu, E. Shechtman, D. Cohen-Or, D. Lischinski

Published: 2021

The latest marketing tactic on LinkedIn: AI-generated faces : NPR

Published: 2022

AI-generated images, like DALL-E, spark rival brands and controversy - Washington Post

Published: 2022

Inside the pentagon’s race against deepfake videos

Published: 2019

Liveness tests used by banks to verify ID are ‘extremely vulnerable’ to deepfake attacks

Published: 2022

Seeing is Living? Rethinking the Security of Facial Liveness Verification in the Deepfake Era

C. Li, L. Wang, S. Ji, X. Zhang, Z. Xi, S. Guo, T. Wang

Published: 2022

As Deepfakes Flourish, Countries Struggle With Response - The New York Times

Published: 2023

Proc. of CVPR

Towards Universal Fake Image Detectors that Generalize Across Generative Models

U. Ojha, Y. Li, Y. J. Lee

Published: 2023

Proc. of VISAPP

Towards the Detection of Diffusion Model Deepfakes

J. Ricker, S. Damm, T. Holz, A. Fischer

Published: 2024