Untargeted Jailbreak Attack Authors: Xinzhe Huang, Wenjing Hu, Tianhang Zheng, Kedong Xiu, Xiaojun Jia, Di Wang, Zhan Qin, Kui Ren | Published: 2025-10-03 | Updated: 2025-10-28 Prompt InjectionPrompt leakingEffectiveness Analysis of Defense Methods 2025.10.03 2025.10.30 Literature Database
Fine-Tuning Jailbreaks under Highly Constrained Black-Box Settings: A Three-Pronged Approach Authors: Xiangfang Li, Yu Wang, Bo Li | Published: 2025-10-01 | Updated: 2025-10-09 Indirect Prompt InjectionPrompt leakingDefense Mechanism 2025.10.01 2025.10.11 Literature Database
MaskSQL: Safeguarding Privacy for LLM-Based Text-to-SQL via Abstraction Authors: Sepideh Abedini, Shubhankar Mohapatra, D. B. Emerson, Masoumeh Shafieinejad, Jesse C. Cresswell, Xi He | Published: 2025-09-27 | Updated: 2025-09-30 SQLクエリ生成Prompt InjectionPrompt leaking 2025.09.27 2025.10.02 Literature Database
Enterprise AI Must Enforce Participant-Aware Access Control Authors: Shashank Shreedhar Bhatt, Tanmay Rajore, Khushboo Aggarwal, Ganesh Ananthanarayanan, Ranveer Chandra, Nishanth Chandran, Suyash Choudhury, Divya Gupta, Emre Kiciman, Sumit Kumar Pandey, Srinath Setty, Rahul Sharma, Teijia Zhao | Published: 2025-09-18 Security AnalysisPrivacy ManagementPrompt leaking 2025.09.18 2025.09.20 Literature Database
Yet Another Watermark for Large Language Models Authors: Siyuan Bao, Ying Shi, Zhiguang Yang, Hanzhou Wu, Xinpeng Zhang | Published: 2025-09-16 Prompt leakingLarge Language ModelWatermarking Technology 2025.09.16 2025.09.18 Literature Database
PromptCOS: Towards System Prompt Copyright Auditing for LLMs via Content-level Output Similarity Authors: Yuchen Yang, Yiming Li, Hongwei Yao, Enhao Huang, Shuo Shao, Bingrun Yang, Zhibo Wang, Dacheng Tao, Zhan Qin | Published: 2025-09-03 Prompt validationPrompt leakingModel Extraction Attack 2025.09.03 2025.09.05 Literature Database
The Double-edged Sword of LLM-based Data Reconstruction: Understanding and Mitigating Contextual Vulnerability in Word-level Differential Privacy Text Sanitization Authors: Stephen Meisenbacher, Alexandra Klymenko, Andreea-Elena Bodea, Florian Matthes | Published: 2025-08-26 Prompt leakingDifferential Privacy文書プライバシー 2025.08.26 2025.08.28 Literature Database
Membership Inference Attacks on LLM-based Recommender Systems Authors: Jiajie He, Yuechun Gu, Min-Chun Chen, Keke Chen | Published: 2025-08-26 Privacy Design PrinciplesPrompt leakingMembership Inference 2025.08.26 2025.08.28 Literature Database
Retrieval-Augmented Defense: Adaptive and Controllable Jailbreak Prevention for Large Language Models Authors: Guangyu Yang, Jinghong Chen, Jingbiao Mei, Weizhe Lin, Bill Byrne | Published: 2025-08-22 | Updated: 2025-11-03 Prompt InjectionPrompt leakingThreat modeling 2025.08.22 2025.11.05 Literature Database
MCPSecBench: A Systematic Security Benchmark and Playground for Testing Model Context Protocols Authors: Yixuan Yang, Daoyuan Wu, Yufan Chen | Published: 2025-08-17 | Updated: 2025-10-09 Prompt leakingLarge Language ModelDefense Mechanism 2025.08.17 2025.10.11 Literature Database