プロンプトインジェクション

No Free Lunch with Guardrails

Authors: Divyanshu Kumar, Nitin Aravind Birur, Tanay Baswa, Sahil Agarwal, Prashanth Harshangi | Published: 2025-04-01 | Updated: 2025-04-03

プロンプトインジェクション

モデルDoS

情報セキュリティ

2025.04.01

文献データベース

Output Constraints as Attack Surface: Exploiting Structured Generation to Bypass LLM Safety Mechanisms

Authors: Shuoming Zhang, Jiacheng Zhao, Ruiyuan Xu, Xiaobing Feng, Huimin Cui | Published: 2025-03-31

LLMセキュリティ

LLMの安全機構の解除

プロンプトインジェクション

2025.03.31 2025.04.03

文献データベース

Detecting Functional Bugs in Smart Contracts through LLM-Powered and Bug-Oriented Composite Analysis

Authors: Binbin Zhao, Xingshuang Lin, Yuan Tian, Saman Zonouz, Na Ruan, Jiliang Li, Raheem Beyah, Shouling Ji | Published: 2025-03-31

インダイレクトプロンプトインジェクション

スマートコントラクト監査

プロンプトインジェクション

2025.03.31 2025.04.03

文献データベース

MiZero: The Shadowy Defender Against Text Style Infringements

Authors: Ziwei Zhang, Juan Wen, Wanli Peng, Zhengxian Wu, Yinghan Zhou, Yiming Xue | Published: 2025-03-30 | Updated: 2025-05-29

プロンプトインジェクション

知的財産保護

透かし技術

2025.03.30

文献データベース

Prompt, Divide, and Conquer: Bypassing Large Language Model Safety Filters via Segmented and Distributed Prompt Processing

Authors: Johan Wahréus, Ahmed Hussain, Panos Papadimitratos | Published: 2025-03-27

システム開発

プロンプトインジェクション

大規模言語モデル

2025.03.27 2025.04.03

文献データベース

Defeating Prompt Injections by Design

Authors: Edoardo Debenedetti, Ilia Shumailov, Tianqi Fan, Jamie Hayes, Nicholas Carlini, Daniel Fabian, Christoph Kern, Chongyang Shi, Andreas Terzis, Florian Tramèr | Published: 2025-03-24

インダイレクトプロンプトインジェクション

プロンプトインジェクション

2025.03.24 2025.04.03

文献データベース

Large Language Models powered Network Attack Detection: Architecture, Opportunities and Case Study

Authors: Xinggong Zhang, Qingyang Li, Yunpeng Tan, Zongming Guo, Lei Zhang, Yong Cui | Published: 2025-03-24

プロンプトインジェクション

プロンプトリーキング

侵入検知システム

2025.03.24 2025.04.03

文献データベース

Knowledge Transfer from LLMs to Provenance Analysis: A Semantic-Augmented Method for APT Detection

Authors: Fei Zuo, Junghwan Rhee, Yung Ryn Choe | Published: 2025-03-24

サイバー脅威インテリジェンス

プロンプトインジェクション

情報抽出

2025.03.24 2025.04.03

文献データベース

STShield: Single-Token Sentinel for Real-Time Jailbreak Detection in Large Language Models

Authors: Xunguang Wang, Wenxuan Wang, Zhenlan Ji, Zongjie Li, Pingchuan Ma, Daoyuan Wu, Shuai Wang | Published: 2025-03-23

プロンプトインジェクション

悪意のあるプロンプト

防御手法の効果分析

2025.03.23 2025.04.03

文献データベース

BadToken: Token-level Backdoor Attacks to Multi-modal Large Language Models

Authors: Zenghui Yuan, Jiawen Shi, Pan Zhou, Neil Zhenqiang Gong, Lichao Sun | Published: 2025-03-20

バックドア攻撃

プロンプトインジェクション

大規模言語モデル

2025.03.20 2025.04.03

文献データベース