Prompt Injection

Reconstruct Your Previous Conversations! Comprehensively Investigating Privacy Leakage Risks in Conversations with GPT Models

Authors: Junjie Chu, Zeyang Sha, Michael Backes, Yang Zhang | Published: 2024-02-05 | Updated: 2024-10-07

Privacy Protection

Prompt Injection

Malicious Prompt

2024.02.05 2025.05.27

Literature Database

Adversarial Text Purification: A Large Language Model Approach for Defense

Authors: Raha Moraffah, Shubh Khandelwal, Amrita Bhattacharjee, Huan Liu | Published: 2024-02-05

Text Generation Method

Prompt Injection

Adversarial Text Purification

2024.02.05 2025.05.27

Literature Database

Jailbreaking Attack against Multimodal Large Language Model

Authors: Zhenxing Niu, Haodong Ren, Xinbo Gao, Gang Hua, Rong Jin | Published: 2024-02-04

Prompt Injection

Malicious Content Generation

Information Gathering Methods

2024.02.04 2025.05.27

Literature Database

Human-Centered Privacy Research in the Age of Large Language Models

Authors: Tianshi Li, Sauvik Das, Hao-Ping Lee, Dakuo Wang, Bingsheng Yao, Zhiping Zhang | Published: 2024-02-03

Privacy Protection

Prompt Injection

Human-Centered Approach

2024.02.03 2025.05.27

Literature Database

Ocassionally Secure: A Comparative Analysis of Code Generation Assistants

Authors: Ran Elgedawy, John Sadik, Senjuti Dutta, Anuj Gautam, Konstantinos Georgiou, Farzin Gholamrezae, Fujiao Ji, Kyungchan Lim, Qian Liu, Scott Ruoti | Published: 2024-02-01

LLM Performance Evaluation

Code Generation

Prompt Injection

2024.02.01 2025.05.27

Literature Database

A Cross-Language Investigation into Jailbreak Attacks in Large Language Models

Authors: Jie Li, Yi Liu, Chongyang Liu, Ling Shi, Xiaoning Ren, Yaowen Zheng, Yang Liu, Yinxing Xue | Published: 2024-01-30

Character Role Acting

Prompt Injection

Multilingual LLM Jailbreak

2024.01.30 2025.05.27

Literature Database

LLM4Vuln: A Unified Evaluation Framework for Decoupling and Enhancing LLMs’ Vulnerability Reasoning

Authors: Yuqiang Sun, Daoyuan Wu, Yue Xue, Han Liu, Wei Ma, Lyuye Zhang, Yang Liu, Yingjiu Li | Published: 2024-01-29 | Updated: 2025-01-13

LLM Performance Evaluation

Prompt Injection

Vulnerability Management

2024.01.29 2025.05.27

Literature Database

Evaluation of LLM Chatbots for OSINT-based Cyber Threat Awareness

Authors: Samaneh Shafee, Alysson Bessani, Pedro M. Ferreira | Published: 2024-01-26 | Updated: 2024-04-19

LLM Performance Evaluation

Cybersecurity

Prompt Injection

2024.01.26 2025.05.27

Literature Database

PsySafe: A Comprehensive Framework for Psychological-based Attack, Defense, and Evaluation of Multi-agent System Safety

Authors: Zaibin Zhang, Yongting Zhang, Lijun Li, Hongzhi Gao, Lijun Wang, Huchuan Lu, Feng Zhao, Yu Qiao, Jing Shao | Published: 2024-01-22 | Updated: 2024-08-20

Prompt Injection

Safety Alignment

Psychological Manipulation

2024.01.22 2025.05.27

Literature Database

BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models

Authors: Zhen Xiang, Fengqing Jiang, Zidi Xiong, Bhaskar Ramasubramanian, Radha Poovendran, Bo Li | Published: 2024-01-20

LLM Performance Evaluation

Backdoor Attack

Prompt Injection

2024.01.20 2025.05.27

Literature Database