Exploring and Developing a Pre-Model Safeguard with Draft Models Authors: Hongyu Cai, Arjun Arunasalam, Yiming Liang, Antonio Bianchi, Z. Berkay Celik | Published: 2026-05-19 LLM SecurityPrompt InjectionLarge Language Model 2026.05.19 2026.05.21 Literature Database
Quantum Machine Learning for Cyber-Physical Anomaly Detection in Unmanned Aerial Vehicles: A Leakage-Free Evaluation with Proxy-Audited Feature Sets Authors: Carlos A. Durán Paredes, Javier E. León Calderón, Nicolás Sánchez Perea, German Darío Díaz, Camilo Segura Quintero | Published: 2026-05-19 UAV個体識別Dataset evaluationQuantum Machine Learning 2026.05.19 2026.05.21 Literature Database
MetaBackdoor: Exploiting Positional Encoding as a Backdoor Attack Surface in LLMs Authors: Rui Wen, Mark Russinovich, Andrew Paverd, Jun Sakuma, Ahmed Salem | Published: 2026-05-14 LLM SecurityData LeakageBackdoor Detection 2026.05.14 2026.05.16 Literature Database
PickleFuzzer: A Case Study in Fuzzing for Discrepancies Between Python Pickle Implementations Authors: Justin Applegate, Andreas Kellas | Published: 2026-05-14 データオブジェクトData Protection MethodWatermark Design 2026.05.14 2026.05.16 Literature Database
Toward Securing AI Agents Like Operating Systems Authors: Lukas Pirch, Micha Horlboge, Patrick Großmann, Syeda Mahnur Asif, Klim Kireev, Thorsten Holz, Konrad Rieck | Published: 2026-05-14 LLM SecurityIndirect Prompt InjectionData Protection Method 2026.05.14 2026.05.16 Literature Database
EVA: Editing for Versatile Alignment against Jailbreaks Authors: Yi Wang, Hongye Qiu, Yue Xu, Sibei Yang, Zhan Qin, Minlie Huang, Wenjie Wang | Published: 2026-05-14 LLM SecurityModel DoS安全性に関連するマルチモーダルなアプローチ 2026.05.14 2026.05.16 Literature Database
Defenses at Odds: Measuring and Explaining Defense Conflicts in Large Language Models Authors: Xiangtao Meng, Wenyu Chen, Chuanchao Zang, Xinyu Gao, Jianing Wang, Li Wang, Zheng Li, Shanqing Guo | Published: 2026-05-14 Bias Detection in AI OutputData Protection MethodModel DoS 2026.05.14 2026.05.16 Literature Database
Exploiting LLM Agent Supply Chains via Payload-less Skills Authors: Xinyu Liu, Yukai Zhao, Xing Hu, Xin Xia | Published: 2026-05-14 LLM SecurityIndirect Prompt InjectionAttack Method 2026.05.14 2026.05.16 Literature Database
Watermarking Game-Playing Agents in Perfect-Information Extensive-Form Games Authors: Juho Kim, Fei Fang, Tuomas Sandholm | Published: 2026-05-14 Digital Watermarking for Generative AIBehavior Analysis MethodWatermark Design 2026.05.14 2026.05.16 Literature Database
Identifying AI Web Scrapers Using Canary Tokens Authors: Steven Seiden, Triss Ren, Caroline Zhang, Taein Kim, Enze Liu, Emily Wenger | Published: 2026-05-13 LLM SecurityData Extraction and AnalysisUser Behavior Analysis 2026.05.13 2026.05.15 Literature Database