MetaDefense: Defending Finetuning-based Jailbreak Attack Before and During Generation

Authors: Weisen Jiang, Sinno Jialin Pan | Published: 2025-10-09

Rethinking Reasoning: A Survey on Reasoning-based Backdoors in LLMs

Authors: Man Hu, Xinyi Wu, Zuofeng Suo, Jinbo Feng, Linghui Meng, Yanhao Jia, Anh Tuan Luu, Shuai Zhao | Published: 2025-10-09

Exposing LLM User Privacy via Traffic Fingerprint Analysis: A Study of Privacy Risks in LLM Agent Interactions

Authors: Yixiang Zhang, Xinhao Deng, Zhongyi Gu, Yihao Chen, Ke Xu, Qi Li, Jianping Wu | Published: 2025-10-08

RedTWIZ: Diverse LLM Red Teaming via Adaptive Attack Planning

Authors: Artur Horal, Daniel Pina, Henrique Paz, Iago Paulo, João Soares, Rafael Ferreira, Diogo Tavares, Diogo Glória-Silva, João Magalhães, David Semedo | Published: 2025-10-08

VelLMes: A high-interaction AI-based deception framework

Authors: Muris Sladić, Veronica Valeros, Carlos Catania, Sebastian Garcia | Published: 2025-10-08

Exposing Citation Vulnerabilities in Generative Engines

Authors: Riku Mochizuki, Shusuke Komatsu, Souta Noguchi, Kazuto Ataka | Published: 2025-10-08

Bionetta: Efficient Client-Side Zero-Knowledge Machine Learning Proving

Authors: Dmytro Zakharov, Oleksandr Kurbatov, Artem Sdobnov, Lev Soukhanov, Yevhenii Sekhin, Vitalii Volovyk, Mykhailo Velykodnyi, Mark Cherepovskyi, Kyrylo Baibula, Lasha Antadze, Pavlo Kravchenko, Volodymyr Dubinin, Yaroslav Panasenko | Published: 2025-10-08

Differentially Private Synthetic Text Generation for Retrieval-Augmented Generation (RAG)

Authors: Junki Mori, Kazuya Kakizaki, Taiki Miyagawa, Jun Sakuma | Published: 2025-10-08

Distilling Lightweight Language Models for C/C++ Vulnerabilities

Authors: Zhiyuan Wei, Xiaoxuan Yang, Jing Sun, Zijian Zhang | Published: 2025-10-08

Code Agent can be an End-to-end System Hacker: Benchmarking Real-world Threats of Computer-use Agent

Authors: Weidi Luo, Qiming Zhang, Tianyu Lu, Xiaogeng Liu, Bin Hu, Hung-Chun Chiu, Siyuan Ma, Yizhe Zhang, Xusheng Xiao, Yinzhi Cao, Zhen Xiang, Chaowei Xiao | Published: 2025-10-08