AIセキュリティポータルbot

Hijacking Agent Memory: Stealthy Trojan Attacks Through Conversational Interaction

Authors: Hongtao Wang, Se Yang, Yu Chen, Puzhuo Liu | Published: 2026-05-28
LLM Security
Indirect Prompt Injection
メモリ効率化手法

Dissecting the Black Box: Circuit-Level Analysis of LLM Vulnerability Detection

Authors: Syafiq Al Atiiq, Chun Zhou, Christian Gehrmann | Published: 2026-05-28
Disabling Safety Mechanisms of LLM
Model Architecture
Interpretation Method

KBF: Knowledge Boundary as Fingerprint for Language Model and Black-Box API Auditing

Authors: Yijia Fang, Yiqing Feng, Bingyu Li, Mingxun Zhou | Published: 2026-05-28
RAG
Data Extraction and Analysis
Model Architecture

SciIntBench: Measuring LLM Compliance with Research Integrity Norms Under Adversarial Framing

Authors: Almene De Meran Meguimtsop, Maria Leonor Pacheco, Daniel E. Acuna | Published: 2026-05-28
Disabling Safety Mechanisms of LLM
Indirect Prompt Injection
Author Contribution

Protecting On-Device AI Inference: A Systematic Review of Attacks and Defence Mechanisms

Authors: Zisis Tsiatsikas, Alexandros Fakis, Georgios Karopoulos, Vasileios Kouliaridis, Marios Anagnostopoulos | Published: 2026-05-28
Data Protection Method
Backdoor Detection
Model Extraction Attack

Provably Secure Agent Guardrail

Authors: Benlong Wu, Weiming Zhang, Kejiang Chen, Han Fang, Nenghai Yu | Published: 2026-05-28
LLM Security
Data Protection Method
Efficient Proof System

Implicit Identity Technologies for LLMs: Fingerprinting and Watermarking across Datasets, Models, and Generated Content

Authors: Bing Liu, Shunping Wang, Yufan Zhu, Xinyi Yu, Jing Huang, Linkang Du, Hongbin Pei, Wei Luo | Published: 2026-05-28
Indirect Prompt Injection
Digital Watermarking for Generative AI
Author Identification Method

Evolving Skill-Structured Attack Memory Enhances LLM Jailbreaking

Authors: Junke Zhang, Jianwei Wang, Sishuo Chen, Yizhang He, Qingshuai Feng, Zhengyi Yang | Published: 2026-05-28
LLM Security
Prompt Injection
メモリ効率化手法

Relevance as a Vulnerability: How Web Retrieval Degrades Safety Alignment in LLM Agents

Authors: Aditya Nawal, Manit Baser, Mohan Gurusamy | Published: 2026-05-28
Relationship of AI Systems
Indirect Prompt Injection
Data Extraction and Analysis

SAMD: A Tool for Identifying False Data Injection Scenarios in AI/ML-enabled Medical Devices

Authors: Mohammadreza Hallajiyan, Xueren Ge, Athish Pranav Dharmalingam, Gargi Mitra, Shahrear Iqbal, Homa Alemzadeh, Karthik Pattabiraman | Published: 2026-05-28
LLM Security
シナリオベースの悪用
Data-Centric Security