Adversarial Text Purification

Large Language Model Sentinel: LLM Agent for Adversarial Purification

Authors: Guang Lin, Toshihisa Tanaka, Qibin Zhao | Published: 2024-05-24 | Updated: 2025-04-23
Prompt validation
Adversarial Text Purification
Defense Mechanism

Adversarial Text Purification: A Large Language Model Approach for Defense

Authors: Raha Moraffah, Shubh Khandelwal, Amrita Bhattacharjee, Huan Liu | Published: 2024-02-05
Text Generation Method
Prompt Injection
Adversarial Text Purification

Adversarial Purification for Data-Driven Power System Event Classifiers with Diffusion Models

Authors: Yuanbin Cheng, Koji Yamashita, Jim Follum, Nanpeng Yu | Published: 2023-11-13
Adversarial Text Purification
Optimization Problem
Defense Method

A Modified Drake Equation for Assessing Adversarial Risk to Machine Learning Models

Authors: Josh Kalin, David Noever, Matthew Ciolino | Published: 2021-03-03 | Updated: 2021-07-07
Risk Analysis Method
Adversarial Text Purification
Machine Learning