Red-Teaming the Agentic Red-Team

Authors: Dario Pasquini, Michal Bazyli, Taras Fedynyshyn, Artem Sorokin | Published: 2026-06-23

PHANTOM: A Large-Scale Dataset of Multimodal Adversarial Attacks for Vision-Language Models

Authors: Simone Gallivanone, Hossein Khodadadi, Mauro Dore, Mauro Medda, Nicola Franco | Published: 2026-06-23

AutoSpec: Safety Rule Evolution for LLM Agents via Inductive Logic Programming

Authors: Pingchuan Ma, Zhaoyu Wang, Zimo Ji, Yuguang Zhou, Zhantong Xue, Zongjie Li, Shuai Wang, Xiaoqin Zhang | Published: 2026-06-23

PixJail: Self-Evolving Paper-to-Pipeline Reproduction for Text-to-Image Jailbreak Evaluation

Authors: Leyi Sheng, Han Sun, Zhen Sun, Yuntao Yue, Jinlin Wu, Xinlei He, Jiaheng Wei | Published: 2026-06-23

An Automated Framework for Input Alphabet Construction in Stateful Protocol Implementation Learning

Authors: JiongHan Wang, WenChao Huang | Published: 2026-06-22

Detecting Malicious Agent Skills in the Wild using Attention

Authors: Bacem Etteib, Daniele Lunghi, Tégawendé F. Bissyandé | Published: 2026-06-22

FlexServe: A Fast and Secure LLM Serving System for Mobile Devices with Flexible Resource Isolation

Authors: Yinpeng Wu, Yitong Chen, Lixiang Wang, Jinyu Gu, Zhichao Hua, Yubin Xia | Published: 2026-06-22

Rethinking Molecular Graph Backdoors under Chemistry-aware Admission

Authors: Thinh T. H. Nguyen, Sze Jue Yang, Khoa D. Doan, Chee Seng Chan, Kok-Seng Wong | Published: 2026-06-22

GIF: Locally Sound Geometric Information Flow Control for LLMs

Authors: Adam Storek, Nikolaus Holzer, Zhuo Zhang, Suman Jana | Published: 2026-06-22

Exposing the Illusion of Erasure in Knowledge Editing for LLMs

Authors: Advik Raj Basani, Anshuman Chhabra | Published: 2026-06-22