AIセキュリティポータルbot

SoK: Unlearnability and Unlearning for Model Dememorization

Authors: Mengying Zhang, Derui Wang, Ruoxi Sun, Xiaoyu Xia, Shuang Hao, Minhui Xue | Published: 2026-05-12
Data Protection Method
Certified Robustness
Model Protection Methods

FlowSteer: Prompt-Only Workflow Steering Exposes Planning-Time Vulnerabilities in Multi-Agent LLM Systems

Authors: Fanxiao Li, Jiaying Wu, Tingchao Fu, Natasha Jaques, Wei Zhou, Min-Yen Kan | Published: 2026-05-12
Indirect Prompt Injection
Data-Centric Security
多エージェントシステムの評価

CTFusion: A CTF-based Benchmark for LLM Agent Evaluation

Authors: Dongjun Lee, Ga-eun Bae, Insu Yun | Published: 2026-05-12
CTF競技
Reliability Assessment
Deception Detection

Can a Single Message Paralyze the AI Infrastructure? The Rise of AbO-DDoS Attacks through Targeted Mobius Injection

Authors: Zi Liang, Ronghua Li, Yanyun Wang, Qingqing Ye, Haibo Hu | Published: 2026-05-12
Indirect Prompt Injection
User Authentication System
Taxonomy of Attacks

Threat Modelling using Domain-Adapted Language Models: Empirical Evaluation and Insights

Authors: Saba Pourhanifeh, AbdulAziz AbdulGhaffar, Ashraf Matrawy | Published: 2026-05-11
Prompt Injection
Prompt leaking
Taxonomy of Attacks

LLMs for Secure Hardware Design and Related Problems: Opportunities and Challenges

Authors: Johann Knechtel, Ozgur Sinanoglu, Ramesh Karri | Published: 2026-05-11
Prompt Injection
Vulnerability Analysis
Design Optimization Methods

Re-Triggering Safeguards within LLMs for Jailbreak Detection

Authors: Zheng Lin, Zhenxing Niu, Haoxuan Ji, Yuzhe Huang, Haichang Gao | Published: 2026-05-11
Prompt Injection
Model Robustness
Large Language Model

Guaranteed Jailbreaking Defense via Disrupt-and-Rectify Smoothing

Authors: Zheng Lin, Zhenxing Niu, Haoxuan Ji, Haichang Gao | Published: 2026-05-11
Disabling Safety Mechanisms of LLM
Prompt Injection
Model Robustness

When Prompts Become Payloads: A Framework for Mitigating SQL Injection Attacks in Large Language Model-Driven Applications

Authors: Farzad Nourmohammadzadeh Motlagh, Mehrdad Hajizadeh, Mehryar Majd, Pejman Najafi, Feng Cheng, Christoph Meinel | Published: 2026-05-11
Indirect Prompt Injection
Prompt validation
Vulnerability Analysis

Benchmarking Safety Risks of Knowledge-Intensive Reasoning under Malicious Knowledge Editing

Authors: Qinghua Mao, Xi Lin, Jinze Gu, Jun Wu, Siyuan Li, Yuliang Chen | Published: 2026-05-11
Prompt leaking
Risk Analysis Method
Knowledge Embedding Algorithm