SciIntBench: Measuring LLM Compliance with Research Integrity Norms Under Adversarial Framing Authors: Almene De Meran Meguimtsop, Maria Leonor Pacheco, Daniel E. Acuna | Published: 2026-05-28 2026.05.28 2026.05.30 Literature Database
Protecting On-Device AI Inference: A Systematic Review of Attacks and Defence Mechanisms Authors: Zisis Tsiatsikas, Alexandros Fakis, Georgios Karopoulos, Vasileios Kouliaridis, Marios Anagnostopoulos | Published: 2026-05-28 2026.05.28 2026.05.30 Literature Database
Provably Secure Agent Guardrail Authors: Benlong Wu, Weiming Zhang, Kejiang Chen, Han Fang, Nenghai Yu | Published: 2026-05-28 2026.05.28 2026.05.30 Literature Database
Implicit Identity Technologies for LLMs: Fingerprinting and Watermarking across Datasets, Models, and Generated Content Authors: Bing Liu, Shunping Wang, Yufan Zhu, Xinyi Yu, Jing Huang, Linkang Du, Hongbin Pei, Wei Luo | Published: 2026-05-28 2026.05.28 2026.05.30 Literature Database
Evolving Skill-Structured Attack Memory Enhances LLM Jailbreaking Authors: Junke Zhang, Jianwei Wang, Sishuo Chen, Yizhang He, Qingshuai Feng, Zhengyi Yang | Published: 2026-05-28 2026.05.28 2026.05.30 Literature Database
Relevance as a Vulnerability: How Web Retrieval Degrades Safety Alignment in LLM Agents Authors: Aditya Nawal, Manit Baser, Mohan Gurusamy | Published: 2026-05-28 2026.05.28 2026.05.30 Literature Database
SAMD: A Tool for Identifying False Data Injection Scenarios in AI/ML-enabled Medical Devices Authors: Mohammadreza Hallajiyan, Xueren Ge, Athish Pranav Dharmalingam, Gargi Mitra, Shahrear Iqbal, Homa Alemzadeh, Karthik Pattabiraman | Published: 2026-05-28 2026.05.28 2026.05.30 Literature Database
Blind PRNG Hijacking: An Undetectable Integrity-Preserving Attack Against LLM Watermarking Authors: Ziyang You, Huilong He, Xiaoke Yang, Xuxing Lu | Published: 2026-05-27 2026.05.27 2026.05.29 Literature Database
Towards Cybersecurity SuperIntelligence (CSI): What’s the best harness for cybersecurity? Authors: Víctor Mayoral-Vilches, Francesco Balassone, María Sanz-Gómez, Paul Zabalegui Landa, Daniel Sánchez Prieto, Marina Oteiza Álvarez, Davide Quarta, Martin Pinzger | Published: 2026-05-27 2026.05.27 2026.05.29 Literature Database
SPARD: Defending Harmful Fine-Tuning Attack via Safety Projection with Relevance-Diversity Data Selection Authors: Shuhao Chen, Weisen Jiang, Yeqi Gong, Shengda Luo, Chengxiang Zhuo, Zang Li, James T. Kwok, Yu Zhang | Published: 2026-05-27 2026.05.27 2026.05.29 Literature Database