From Defender to Devil? Unintended Risk Interactions Induced by LLM Defenses Authors: Xiangtao Meng, Tianshuo Cong, Li Wang, Wenyu Chen, Zheng Li, Shanqing Guo, Xiaoyun Wang | Published: 2025-10-09 2025.10.09 文献データベース
MetaDefense: Defending Finetuning-based Jailbreak Attack Before and During Generation Authors: Weisen Jiang, Sinno Jialin Pan | Published: 2025-10-09 2025.10.09 文献データベース
Rethinking Reasoning: A Survey on Reasoning-based Backdoors in LLMs Authors: Man Hu, Xinyi Wu, Zuofeng Suo, Jinbo Feng, Linghui Meng, Yanhao Jia, Anh Tuan Luu, Shuai Zhao | Published: 2025-10-09 2025.10.09 文献データベース
Exposing LLM User Privacy via Traffic Fingerprint Analysis: A Study of Privacy Risks in LLM Agent Interactions Authors: Yixiang Zhang, Xinhao Deng, Zhongyi Gu, Yihao Chen, Ke Xu, Qi Li, Jianping Wu | Published: 2025-10-08 2025.10.08 文献データベース
RedTWIZ: Diverse LLM Red Teaming via Adaptive Attack Planning Authors: Artur Horal, Daniel Pina, Henrique Paz, Iago Paulo, João Soares, Rafael Ferreira, Diogo Tavares, Diogo Glória-Silva, João Magalhães, David Semedo | Published: 2025-10-08 2025.10.08 文献データベース
VelLMes: A high-interaction AI-based deception framework Authors: Muris Sladić, Veronica Valeros, Carlos Catania, Sebastian Garcia | Published: 2025-10-08 2025.10.08 文献データベース
Exposing Citation Vulnerabilities in Generative Engines Authors: Riku Mochizuki, Shusuke Komatsu, Souta Noguchi, Kazuto Ataka | Published: 2025-10-08 2025.10.08 文献データベース
Bionetta: Efficient Client-Side Zero-Knowledge Machine Learning Proving Authors: Dmytro Zakharov, Oleksandr Kurbatov, Artem Sdobnov, Lev Soukhanov, Yevhenii Sekhin, Vitalii Volovyk, Mykhailo Velykodnyi, Mark Cherepovskyi, Kyrylo Baibula, Lasha Antadze, Pavlo Kravchenko, Volodymyr Dubinin, Yaroslav Panasenko | Published: 2025-10-08 2025.10.08 文献データベース
Differentially Private Synthetic Text Generation for Retrieval-Augmented Generation (RAG) Authors: Junki Mori, Kazuya Kakizaki, Taiki Miyagawa, Jun Sakuma | Published: 2025-10-08 2025.10.08 文献データベース
Distilling Lightweight Language Models for C/C++ Vulnerabilities Authors: Zhiyuan Wei, Xiaoxuan Yang, Jing Sun, Zijian Zhang | Published: 2025-10-08 2025.10.08 文献データベース