Prompt Injection

Survival of the Safest: Towards Secure Prompt Optimization through Interleaved Multi-Objective Evolution

Authors: Ankita Sinha, Wendi Cui, Kamalika Das, Jiaxin Zhang | Published: 2024-10-12

Prompt Injection

Multi-Objective Prompt Optimization

2024.10.12 2025.05.27

Literature Database

Can a large language model be a gaslighter?

Authors: Wei Li, Luyao Zhu, Yang Song, Ruixi Lin, Rui Mao, Yang You | Published: 2024-10-11

Prompt Injection

Safety Alignment

Attack Method

2024.10.11 2025.05.27

Literature Database

F2A: An Innovative Approach for Prompt Injection by Utilizing Feign Security Detection Agents

Authors: Yupeng Ren | Published: 2024-10-11 | Updated: 2024-10-14

Prompt Injection

Attack Evaluation

Attack Method

2024.10.11 2025.05.27

Literature Database

PILLAR: an AI-Powered Privacy Threat Modeling Tool

Authors: Majid Mollaeefar, Andrea Bissoli, Silvio Ranise | Published: 2024-10-11

Privacy Protection

Privacy Protection Method

Prompt Injection

2024.10.11 2025.05.27

Literature Database

APOLLO: A GPT-based tool to detect phishing emails and generate explanations that warn users

Authors: Giuseppe Desolda, Francesco Greco, Luca Viganò | Published: 2024-10-10

Phishing Detection

Prompt Injection

User Education

2024.10.10 2025.05.27

Literature Database

Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy

Authors: Tong Wu, Shujian Zhang, Kaiqiang Song, Silei Xu, Sanqiang Zhao, Ravi Agrawal, Sathish Reddy Indurthi, Chong Xiang, Prateek Mittal, Wenxuan Zhou | Published: 2024-10-09

LLM Performance Evaluation

Prompt Injection

2024.10.09 2025.05.27

Literature Database

Prompt Infection: LLM-to-LLM Prompt Injection within Multi-Agent Systems

Authors: Donghyun Lee, Mo Tiwari | Published: 2024-10-09

Prompt Injection

Attack Method

Defense Method

2024.10.09 2025.05.27

Literature Database

Hallucinating AI Hijacking Attack: Large Language Models and Malicious Code Recommenders

Authors: David Noever, Forrest McKee | Published: 2024-10-09

Cybersecurity

Prompt Injection

Attack Method

2024.10.09 2025.05.27

Literature Database

SecAlign: Defending Against Prompt Injection with Preference Optimization

Authors: Sizhe Chen, Arman Zharmagambetov, Saeed Mahloujifar, Kamalika Chaudhuri, David Wagner, Chuan Guo | Published: 2024-10-07 | Updated: 2025-01-13

LLM Security

Prompt Injection

Defense Method

2024.10.07 2025.05.27

Literature Database

Enhancing Android Malware Detection: The Influence of ChatGPT on Decision-centric Task

Authors: Yao Li, Sen Fang, Tao Zhang, Haipeng Cai | Published: 2024-10-06

Prompt Injection

Malware Classification

2024.10.06 2025.05.27

Literature Database