Prompt Injection

No Free Lunch with Guardrails

Authors: Divyanshu Kumar, Nitin Aravind Birur, Tanay Baswa, Sahil Agarwal, Prashanth Harshangi | Published: 2025-04-01 | Updated: 2025-04-03

Prompt Injection

Model DoS

Information Security

2025.04.01 2025.05.27

Literature Database

Output Constraints as Attack Surface: Exploiting Structured Generation to Bypass LLM Safety Mechanisms

Authors: Shuoming Zhang, Jiacheng Zhao, Ruiyuan Xu, Xiaobing Feng, Huimin Cui | Published: 2025-03-31

LLM Security

Disabling Safety Mechanisms of LLM

Prompt Injection

2025.03.31 2025.05.27

Literature Database

Detecting Functional Bugs in Smart Contracts through LLM-Powered and Bug-Oriented Composite Analysis

Authors: Binbin Zhao, Xingshuang Lin, Yuan Tian, Saman Zonouz, Na Ruan, Jiliang Li, Raheem Beyah, Shouling Ji | Published: 2025-03-31

Indirect Prompt Injection

Smart Contract Audit

Prompt Injection

2025.03.31 2025.05.27

Literature Database

MiZero: The Shadowy Defender Against Text Style Infringements

Authors: Ziwei Zhang, Juan Wen, Wanli Peng, Zhengxian Wu, Yinghan Zhou, Yiming Xue | Published: 2025-03-30 | Updated: 2025-05-29

Prompt Injection

Intellectual Property Protection

Watermarking Technology

2025.03.30 2025.05.31

Literature Database

Prompt, Divide, and Conquer: Bypassing Large Language Model Safety Filters via Segmented and Distributed Prompt Processing

Authors: Johan Wahréus, Ahmed Hussain, Panos Papadimitratos | Published: 2025-03-27

System Development

Prompt Injection

Large Language Model

2025.03.27 2025.05.27

Literature Database

Defeating Prompt Injections by Design

Authors: Edoardo Debenedetti, Ilia Shumailov, Tianqi Fan, Jamie Hayes, Nicholas Carlini, Daniel Fabian, Christoph Kern, Chongyang Shi, Andreas Terzis, Florian Tramèr | Published: 2025-03-24

Indirect Prompt Injection

Prompt Injection

2025.03.24 2025.05.27

Literature Database

Large Language Models powered Network Attack Detection: Architecture, Opportunities and Case Study

Authors: Xinggong Zhang, Qingyang Li, Yunpeng Tan, Zongming Guo, Lei Zhang, Yong Cui | Published: 2025-03-24

Prompt Injection

Prompt leaking

Intrusion Detection System

2025.03.24 2025.05.27

Literature Database

Knowledge Transfer from LLMs to Provenance Analysis: A Semantic-Augmented Method for APT Detection

Authors: Fei Zuo, Junghwan Rhee, Yung Ryn Choe | Published: 2025-03-24

Cyber Threat Intelligence

Prompt Injection

Information Extraction

2025.03.24 2025.05.27

Literature Database

STShield: Single-Token Sentinel for Real-Time Jailbreak Detection in Large Language Models

Authors: Xunguang Wang, Wenxuan Wang, Zhenlan Ji, Zongjie Li, Pingchuan Ma, Daoyuan Wu, Shuai Wang | Published: 2025-03-23

Prompt Injection

Malicious Prompt

Effectiveness Analysis of Defense Methods

2025.03.23 2025.05.27

Literature Database

BadToken: Token-level Backdoor Attacks to Multi-modal Large Language Models

Authors: Zenghui Yuan, Jiawen Shi, Pan Zhou, Neil Zhenqiang Gong, Lichao Sun | Published: 2025-03-20

Backdoor Attack

Prompt Injection

Large Language Model

2025.03.20 2025.05.27

Literature Database