Prompt leaking

Securing the AI Supply Chain: What Can We Learn From Developer-Reported Security Issues and Solutions of AI Projects?

Authors: The Anh Nguyen, Triet Huynh Minh Le, M. Ali Babar | Published: 2025-12-29
Security Analysis Method
Data-Driven Vulnerability Assessment
Prompt leaking

GateBreaker: Gate-Guided Attacks on Mixture-of-Expert LLMs

Authors: Lichao Wu, Sasha Behrouzi, Mohamadreza Rostami, Stjepan Picek, Ahmad-Reza Sadeghi | Published: 2025-12-24
Sparse Model
Prompt leaking
安全性に関連するマルチモーダルなアプローチ

ChatGPT: Excellent Paper! Accept It. Editor: Imposter Found! Review Rejected

Authors: Kanchon Gharami, Sanjiv Kumar Sarkar, Yongxin Liu, Shafika Showkat Moni | Published: 2025-12-23
Prompt leaking
Model Extraction Attack
Adversarial Attack Assessment

From Retrieval to Reasoning: A Framework for Cyber Threat Intelligence NER with Explicit and Adaptive Instructions

Authors: Jiaren Peng, Hongda Sun, Xuan Tian, Cheng Huang, Zeqing Li, Rui Yan | Published: 2025-12-22
RAG
Data Selection Strategy
Prompt leaking

Prefix Probing: Lightweight Harmful Content Detection for Large Language Models

Authors: Jirui Yang, Hengqi Guo, Zhihui Lu, Yi Zhao, Yuansen Zhang, Shijing Hu, Qiang Duan, Yinggui Wang, Tao Wei | Published: 2025-12-18
Token Distribution Analysis
Prompt Injection
Prompt leaking

In-Context Probing for Membership Inference in Fine-Tuned Language Models

Authors: Zhexi Lu, Hongliang Chi, Nathalie Baracaldo, Swanand Ravindra Kadhe, Yuseok Jeon, Lei Yu | Published: 2025-12-18
Bias Detection in AI Output
Privacy-Preserving Machine Learning
Prompt leaking

ContextLeak: Auditing Leakage in Private In-Context Learning Methods

Authors: Jacob Choi, Shuying Cao, Xingjian Dong, Wang Bill Zhu, Robin Jia, Sai Praneeth Karimireddy | Published: 2025-12-18
Data Leakage
Privacy-Preserving Machine Learning
Prompt leaking

PerProb: Indirectly Evaluating Memorization in Large Language Models

Authors: Yihan Liao, Jacky Keung, Xiaoxue Ma, Jingyu Zhang, Yicheng Sun | Published: 2025-12-16
Indirect Prompt Injection
Privacy protection framework
Prompt leaking

On the Effectiveness of Membership Inference in Targeted Data Extraction from Large Language Models

Authors: Ali Al Sahili, Ali Chehab, Razane Tajeddine | Published: 2025-12-15
Data Extraction and Analysis
Prompt leaking
評価メトリクス

CTIGuardian: A Few-Shot Framework for Mitigating Privacy Leakage in Fine-Tuned LLMs

Authors: Shashie Dilhara Batan Arachchige, Benjamin Zi Hao Zhao, Hassan Jameel Asghar, Dinusha Vatsalan, Dali Kaafar | Published: 2025-12-15
Trade-off Analysis
Privacy Protection Method
Prompt leaking