インダイレクトプロンプトインジェクション

Taming OpenClaw: Security Analysis and Mitigation of Autonomous LLM Agent Threats

Authors: Xinhao Deng, Yixiang Zhang, Jiaqing Wu, Jiaqi Bai, Sibo Yi, Zhuoheng Zou, Yue Xiao, Rennai Qiu, Jianan Ma, Jialuo Chen, Xiaohu Du, Xiaofang Yang, Shiwen Cui, Changhua Meng, Weiqiang Wang, Jiaxing Song, Ke Xu, Qi Li | Published: 2026-03-12

プロンプトインジェクション

脆弱性管理

2026.03.12

文献データベース

Don’t Let the Claw Grip Your Hand: A Security Analysis and Defense Framework for OpenClaw

Authors: Zhengyang Shan, Jiayun Xin, Yue Zhang, Minghui Xu | Published: 2026-03-11

インダイレクトプロンプトインジェクション

プロンプトインジェクション

安全性分析

2026.03.11

文献データベース

CyberThreat-Eval: Can Large Language Models Automate Real-World Threat Research?

Authors: Xiangsen Chen, Xuan Feng, Shuo Chen, Matthieu Maitre, Sudipto Rakshit, Diana Duvieilh, Ashley Picone, Nan Tang | Published: 2026-03-10

LLMの安全機構の解除

LLM性能評価

インダイレクトプロンプトインジェクション

2026.03.10

文献データベース

Image-based Prompt Injection: Hijacking Multimodal LLMs through Visually Embedded Adversarial Instructions

Authors: Neha Nagaraja, Lan Zhang, Zhilong Wang, Bo Zhang, Pawan Patil | Published: 2026-03-04

インダイレクトプロンプトインジェクション

プロンプト埋め込み手法

視覚的手法

2026.03.04

文献データベース

ZeroDayBench: Evaluating LLM Agents on Unseen Zero-Day Vulnerabilities for Cyberdefense

Authors: Nancy Lau, Louis Sloot, Jyoutir Raj, Giuseppe Marco Boscardin, Evan Harris, Dylan Bowman, Mario Brajkovski, Jaideep Chawla, Dan Zhao | Published: 2026-03-02

LLM性能評価

インダイレクトプロンプトインジェクション

脆弱性評価手法

2026.03.02

文献データベース

DualSentinel: A Lightweight Framework for Detecting Targeted Attacks in Black-box LLM via Dual Entropy Lull Pattern

Authors: Xiaoyi Pang, Xuanyi Hao, Pengyu Liu, Qi Luo, Song Guo, Zhibo Wang | Published: 2026-03-02

LLM性能評価

インダイレクトプロンプトインジェクション

プロンプトインジェクション

2026.03.02

文献データベース

From Secure Agentic AI to Secure Agentic Web: Challenges, Threats, and Future Directions

Authors: Zhihang Deng, Jiaping Gui, Weinan Zhang | Published: 2026-03-02

インダイレクトプロンプトインジェクション

安全性評価

脅威モデル

2026.03.02

文献データベース

Inference-Time Safety For Code LLMs Via Retrieval-Augmented Revision

Authors: Manisha Mukherjee, Vincent J. Hellendoorn | Published: 2026-03-02

インダイレクトプロンプトインジェクション

セキュリティに関連する知識を活用した手法

プロンプトリーキング

2026.03.02

文献データベース

AgentSentry: Mitigating Indirect Prompt Injection in LLM Agents via Temporal Causal Diagnostics and Context Purification

Authors: Tian Zhang, Yiwei Xu, Juan Wang, Keyan Guo, Xiaoyang Xu, Bowen Xiao, Quanlong Guan, Jinlin Fan, Jiawei Liu, Zhiquan Liu, Hongxin Hu | Published: 2026-02-26

インダイレクトプロンプトインジェクション

カウンターファクチュアル説明

データ管理システム

2026.02.26

文献データベース

The LLMbda Calculus: AI Agents, Conversations, and Information Flow

Authors: Zac Garby, Andrew D. Gordon, David Sands | Published: 2026-02-23

インダイレクトプロンプトインジェクション

セキュリティ分析手法

データ流分析

2026.02.23

文献データベース