Prompt Injection

FlipLLM: Efficient Bit-Flip Attacks on Multimodal LLMs using Reinforcement Learning

Authors: Khurram Khalil, Khaza Anuarul Hoque | Published: 2025-12-10
Prompt Injection
Large Language Model
Vulnerability Assessment Method

Chasing Shadows: Pitfalls in LLM Security Research

Authors: Jonathan Evertz, Niklas Risse, Nicolai Neuer, Andreas Müller, Philipp Normann, Gaetano Sapia, Srishti Gupta, David Pape, Soumya Shaw, Devansh Srivastav, Christian Wressnegger, Erwin Quiring, Thorsten Eisenhofer, Daniel Arp, Lea Schönherr | Published: 2025-12-10
Prompt Injection
Prompt leaking

Democratizing ML for Enterprise Security: A Self-Sustained Attack Detection Framework

Authors: Sadegh Momeni, Ge Zhang, Birkett Huber, Hamza Harkous, Sam Lipton, Benoit Seguin, Yanis Pavlidis | Published: 2025-12-09
Cybersecurity
Safety of Data Generation
Prompt Injection

ThinkTrap: Denial-of-Service Attacks against Black-box LLM Services via Infinite Thinking

Authors: Yunzhe Li, Jianan Wang, Hongzi Zhu, James Lin, Shan Chang, Minyi Guo | Published: 2025-12-08
DoS Mitigation
Prompt Injection
Model DoS

SoK: a Comprehensive Causality Analysis Framework for Large Language Model Security

Authors: Wei Zhao, Zhe Li, Jun Sun | Published: 2025-12-04
Prompt Injection
因果推論
Large Language Model

In-Context Representation Hijacking

Authors: Itay Yona, Amir Sarid, Michael Karasik, Yossi Gandelsman | Published: 2025-12-03
Cybersecurity
Prompt Injection
Prompt leaking

HarnessAgent: Scaling Automatic Fuzzing Harness Construction with Tool-Augmented LLM Pipelines

Authors: Kang Yang, Yunhang Zhang, Zichuan Li, GuanHong Tao, Jun Xu, XiaoJing Liao | Published: 2025-12-03
Prompt Injection
Model DoS
自動化ペネトレーションテスト

Immunity memory-based jailbreak detection: multi-agent adaptive guard for large language models

Authors: Jun Leng, Litian Zhang, Xi Zhang | Published: 2025-12-03
Prompt Injection
メモリ化メカニズム
Attack Detection Method

Contextual Image Attack: How Visual Context Exposes Multimodal Safety Vulnerabilities

Authors: Yuan Xiong, Ziqi Miao, Lijun Li, Chen Qian, Jie Li, Jing Shao | Published: 2025-12-02
Prompt Injection
Model DoS
Image Generation Technology

CryptoQA: A Large-scale Question-answering Dataset for AI-assisted Cryptography

Authors: Mayar Elfares, Pascal Reisert, Tilman Dietz, Manpa Barman, Ahmed Zaki, Ralf Küsters, Andreas Bulling | Published: 2025-12-02
Dataset Generation
Prompt Injection
Prompt leaking