SafeReview: Defending LLM-based Review Systems Against Adversarial Hidden Prompts

Authors: Yuan Xin, Yixuan Weng, Minjun Zhu, Ying Ling, Chengwei Qin, Michael Hahn, Michael Backes, Yue Zhang, Linyi Yang | Published: 2026-04-29

Quantamination: Dynamic Quantization Leaks Your Data Across the Batch

Authors: Hanna Foerster, Ilia Shumailov, Cheng Zhang, Yiren Zhao, Jamie Hayes, Robert Mullins | Published: 2026-04-29

Towards Agentic Investigation of Security Alerts

Authors: Even Eilertsen, Vasileios Mavroeidis, Gudmund Grov | Published: 2026-04-28

From CRUD to Autonomous Agents: Formal Validation and Zero-Trust Security for Semantic Gateways in AI-Native Enterprise Systems

Authors: Ignacio Peyrano | Published: 2026-04-28

MARD: A Multi-Agent Framework for Robust Android Malware Detection

Authors: Xueying Zeng, Youquan Xian, Sihao Liu, Xudong Mou, Yanze Li, Lei Cui, Bo Li | Published: 2026-04-28

R-CoT: A Reasoning-Layer Watermark via Redundant Chain-of-Thought in Large Language Models

Authors: Ziming Zhang, Li Li, Guorui Feng, Hanzhou Wu, Xinpeng Zhang | Published: 2026-04-28

Making AI-Assisted Grant Evaluation Auditable without Exposing the Model

Authors: Kemal Bicakci | Published: 2026-04-28

AgentWard: A Lifecycle Security Architecture for Autonomous AI Agents

Authors: Yixiang Zhang, Xinhao Deng, Jiaqing Wu, Yue Xiao, Ke Xu, Qi Li | Published: 2026-04-27

Layerwise Convergence Fingerprints for Runtime Misbehavior Detection in Large Language Models

Authors: Nay Myat Min, Long H. Pham, Jun Sun | Published: 2026-04-27

GAMMAF: A Common Framework for Graph-Based Anomaly Monitoring Benchmarking in LLM Multi-Agent Systems

Authors: Pablo Mateo-Torrejón, Alfonso Sánchez-Macián | Published: 2026-04-27