Literature Database

Model-Agnostic Lifelong LLM Safety via Externalized Attack-Defense Co-Evolution

Authors: Xiaozhe Zhang, Chaozhuo Li, Hui Liu, Shaocheng Yan, Bingyu Yan, Qiwei Ye, Haoliang Li | Published: 2026-05-13
Disabling Safety Mechanisms of LLM
Alignment
Behavior Analysis Method

Empowering IoT Security: On-Device Intrusion Detection in Resource Constrained Devices

Authors: Vasilis Ieropoulos, Eirini Anthi, Theodoros Spyridopoulos, Pete Burnap, Aftab Khan, Pietro Carnelli | Published: 2026-05-13
IoT Cybersecurity
Data Protection Method
Machine Learning Application

Quantifying LLM Safety Degradation Under Repeated Attacks Using Survival Analysis

Authors: Zvi Topol | Published: 2026-05-13
LLM Security
Prompt Injection
Behavior Analysis Method

Persona-Model Collapse in Emergent Misalignment

Authors: Davi Bastos Costa, Renato Vicente | Published: 2026-05-13
Dataset evaluation
User Behavior Analysis
Behavior Analysis Method

HE-PIM: Demystifying Homomorphic Operations on a Real-world Processing-in-Memory System

Authors: Harshita Gupta, Mayank Kabra, Jaewoo Park, Priyam Mehta, Phillip Widdowson, Tathagata Barik, Nisa Bostancı, Konstantinos Kanellopoulos, Juan Gómez-Luna, Antonio J. Peña, Mohammad Sadrosadati, Onur Mutlu | Published: 2026-05-13
Efficiency Evaluation
Computational Complexity
Watermark Design

SoK: Unlearnability and Unlearning for Model Dememorization

Authors: Mengying Zhang, Derui Wang, Ruoxi Sun, Xiaoyu Xia, Shuang Hao, Minhui Xue | Published: 2026-05-12
Data Protection Method
Certified Robustness
Model Protection Methods

FlowSteer: Prompt-Only Workflow Steering Exposes Planning-Time Vulnerabilities in Multi-Agent LLM Systems

Authors: Fanxiao Li, Jiaying Wu, Tingchao Fu, Natasha Jaques, Wei Zhou, Min-Yen Kan | Published: 2026-05-12
Indirect Prompt Injection
Data-Centric Security
多エージェントシステムの評価

CTFusion: A CTF-based Benchmark for LLM Agent Evaluation

Authors: Dongjun Lee, Ga-eun Bae, Insu Yun | Published: 2026-05-12
CTF競技
Reliability Assessment
Deception Detection

Can a Single Message Paralyze the AI Infrastructure? The Rise of AbO-DDoS Attacks through Targeted Mobius Injection

Authors: Zi Liang, Ronghua Li, Yanyun Wang, Qingqing Ye, Haibo Hu | Published: 2026-05-12
Indirect Prompt Injection
User Authentication System
Taxonomy of Attacks

Threat Modelling using Domain-Adapted Language Models: Empirical Evaluation and Insights

Authors: Saba Pourhanifeh, AbdulAziz AbdulGhaffar, Ashraf Matrawy | Published: 2026-05-11
Prompt Injection
Prompt leaking
Taxonomy of Attacks