DualSentinel: A Lightweight Framework for Detecting Targeted Attacks in Black-box LLM via Dual Entropy Lull Pattern Authors: Xiaoyi Pang, Xuanyi Hao, Pengyu Liu, Qi Luo, Song Guo, Zhibo Wang | Published: 2026-03-02 2026.03.02 2026.03.04 Literature Database
From Secure Agentic AI to Secure Agentic Web: Challenges, Threats, and Future Directions Authors: Zhihang Deng, Jiaping Gui, Weinan Zhang | Published: 2026-03-02 2026.03.02 2026.03.04 Literature Database
Towards Privacy-Preserving LLM Inference via Collaborative Obfuscation (Technical Report) Authors: Yu Lin, Qizhi Zhang, Wenqiang Ruan, Daode Zhang, Jue Hong, Ye Wu, Hanning Xia, Yunlong Mao, Sheng Zhong | Published: 2026-03-02 2026.03.02 2026.03.04 Literature Database
Inference-Time Safety For Code LLMs Via Retrieval-Augmented Revision Authors: Manisha Mukherjee, Vincent J. Hellendoorn | Published: 2026-03-02 2026.03.02 2026.03.04 Literature Database
LLM Novice Uplift on Dual-Use, In Silico Biology Tasks Authors: Chen Bo Calvin Zhang, Christina Q. Knight, Nicholas Kruus, Jason Hausenloy, Pedro Medeiros, Nathaniel Li, Aiden Kim, Yury Orlovskiy, Coleman Breen, Bryce Cai, Jasper Götting, Andrew Bo Liu, Samira Nedungadi, Paula Rodriguez, Yannis Yiming He, Mohamed Shaaban, Zifan Wang, Seth Donoughe, Julian Michael | Published: 2026-02-26 2026.02.26 2026.02.28 Literature Database
A Decision-Theoretic Formalisation of Steganography With Applications to LLM Monitoring Authors: Usman Anwar, Julianna Piskorz, David D. Baek, David Africa, Jim Weatherall, Max Tegmark, Christian Schroeder de Witt, Mihaela van der Schaar, David Krueger | Published: 2026-02-26 2026.02.26 2026.02.28 Literature Database
Assessing Deanonymization Risks with Stylometry-Assisted LLM Agent Authors: Boyang Zhang, Yang Zhang | Published: 2026-02-26 2026.02.26 2026.02.28 Literature Database
Obscure but Effective: Classical Chinese Jailbreak Prompt Optimization via Bio-Inspired Search Authors: Xun Huang, Simeng Qin, Xiaoshuang Jia, Ranjie Duan, Huanqian Yan, Zhitao Zeng, Fei Yang, Yang Liu, Xiaojun Jia | Published: 2026-02-26 2026.02.26 2026.02.28 Literature Database
AgentSentry: Mitigating Indirect Prompt Injection in LLM Agents via Temporal Causal Diagnostics and Context Purification Authors: Tian Zhang, Yiwei Xu, Juan Wang, Keyan Guo, Xiaoyang Xu, Bowen Xiao, Quanlong Guan, Jinlin Fan, Jiawei Liu, Zhiquan Liu, Hongxin Hu | Published: 2026-02-26 2026.02.26 2026.02.28 Literature Database
IMMACULATE: A Practical LLM Auditing Framework via Verifiable Computation Authors: Yanpei Guo, Wenjie Qu, Linyu Wu, Shengfang Zhai, Lionel Z. Wang, Ming Xu, Yue Liu, Binhang Yuan, Dawn Song, Jiaheng Zhang | Published: 2026-02-26 2026.02.26 2026.02.28 Literature Database