Attack Detection Method

Immunity memory-based jailbreak detection: multi-agent adaptive guard for large language models

Authors: Jun Leng, Litian Zhang, Xi Zhang | Published: 2025-12-03

Prompt Injection

メモリ化メカニズム

Attack Detection Method

2025.12.03 2025.12.05

Literature Database

Embedding Poisoning: Bypassing Safety Alignment via Embedding Semantic Shift

Authors: Shuai Yuan, Zhibo Zhang, Yuxi Li, Guangdong Bai, Wang Kailong | Published: 2025-09-08

Disabling Safety Mechanisms of LLM

Calculation of Output Harmfulness

Attack Detection Method

2025.09.08 2025.09.10

Literature Database

Adversarial Suffix Filtering: a Defense Pipeline for LLMs

Authors: David Khachaturov, Robert Mullins | Published: 2025-05-14

Prompt validation

倫理基準遵守

Attack Detection Method

2025.05.14 2025.05.28

Literature Database

CANTXSec: A Deterministic Intrusion Detection and Prevention System for CAN Bus Monitoring ECU Activations

Authors: Denis Donadel, Kavya Balasubramanian, Alessandro Brighente, Bhaskar Ramasubramanian, Mauro Conti, Radha Poovendran | Published: 2025-05-14

Testbed

Attack Detection Method

電子制御ユニット

2025.05.14 2025.05.28

Literature Database

Evaluating the Robustness of Adversarial Defenses in Malware Detection Systems

Authors: Mostafa Jafari, Alireza Shameli-Sendi | Published: 2025-05-14

Robustness Analysis

Attack Detection Method

Adversarial Learning

2025.05.14 2025.05.28

Literature Database

Instantiating Standards: Enabling Standard-Driven Text TTP Extraction with Evolvable Memory

Authors: Cheng Meng, ZhengWei Jiang, QiuYun Wang, XinYi Li, ChunYan Ma, FangMing Dong, FangLi Ren, BaoXu Liu | Published: 2025-05-14

Prompt leaking

Attack Detection Method

Knowledge Extraction Method

2025.05.14 2025.05.28

Literature Database

I Know What You Said: Unveiling Hardware Cache Side-Channels in Local Large Language Model Inference

Authors: Zibo Gao, Junjie Hu, Feng Guo, Yixin Zhang, Yinglong Han, Siyuan Liu, Haiyang Li, Zhiqiang Lv | Published: 2025-05-10 | Updated: 2025-05-14

Disabling Safety Mechanisms of LLM

Prompt leaking

Attack Detection Method

2025.05.10 2025.05.28

Literature Database

Les Dissonances: Cross-Tool Harvesting and Polluting in Multi-Tool Empowered LLM Agents

Authors: Zichuan Li, Jian Cui, Xiaojing Liao, Luyi Xing | Published: 2025-04-04 | Updated: 2025-04-28

Indirect Prompt Injection

Vulnerabilities of Tools

Attack Detection Method

2025.04.04 2025.05.27

Literature Database

From Sands to Mansions: Towards Automated Cyberattack Emulation with Classical Planning and Large Language Models

Authors: Lingzhi Wang, Zhenyuan Li, Yi Jiang, Zhengkai Wang, Zonghan Guo, Jiahui Wang, Yangyang Wei, Xiangmin Shen, Wei Ruan, Yan Chen | Published: 2024-07-24 | Updated: 2025-04-17

Prompt leaking

Attack Action Model

Attack Detection Method

2024.07.24 2025.05.27

Literature Database