プロンプトインジェクション

GShield: Mitigating Poisoning Attacks in Federated Learning

Authors: Sameera K. M., Serena Nicolazzo, Antonino Nocera, Vinod P., Rafidha Rehiman K. A | Published: 2025-12-22

データ毒性攻撃

プロンプトインジェクション

ポイズニング

2025.12.22

文献データベース

Efficient Jailbreak Mitigation Using Semantic Linear Classification in a Multi-Staged Pipeline

Authors: Akshaj Prashanth Rao, Advait Singh, Saumya Kumaar Saksena, Dhruv Kumar | Published: 2025-12-22

プロンプトインジェクション

透かし

防御メカニズム

2025.12.22

文献データベース

Prefix Probing: Lightweight Harmful Content Detection for Large Language Models

Authors: Jirui Yang, Hengqi Guo, Zhihui Lu, Yi Zhao, Yuansen Zhang, Shijing Hu, Qiang Duan, Yinggui Wang, Tao Wei | Published: 2025-12-18

トークン分布分析

プロンプトインジェクション

プロンプトリーキング

2025.12.18

文献データベース

A Systematic Study of Code Obfuscation Against LLM-based Vulnerability Detection

Authors: Xiao Li, Yue Li, Hao Wu, Yue Zhang, Yechao Zhang, Fengyuan Xu, Sheng Zhong | Published: 2025-12-18

インダイレクトプロンプトインジェクション

プロンプトインジェクション

難読化手法

2025.12.18

文献データベース

Quantifying Return on Security Controls in LLM Systems

Authors: Richard Helder Moulton, Austin O'Brien, John D. Hastings | Published: 2025-12-17

プロンプトインジェクション

リスク分析手法

脆弱性検出手法

2025.12.17

文献データベース

PentestEval: Benchmarking LLM-based Penetration Testing with Modular and Stage-Level Design

Authors: Ruozhao Yang, Mingfei Cheng, Gelei Deng, Tianwei Zhang, Junjie Wang, Xiaofei Xie | Published: 2025-12-16

インダイレクトプロンプトインジェクション

プロンプトインジェクション

脆弱性管理

2025.12.16

文献データベース

FlipLLM: Efficient Bit-Flip Attacks on Multimodal LLMs using Reinforcement Learning

Authors: Khurram Khalil, Khaza Anuarul Hoque | Published: 2025-12-10

プロンプトインジェクション

大規模言語モデル

脆弱性評価手法

2025.12.10

文献データベース

Chasing Shadows: Pitfalls in LLM Security Research

Authors: Jonathan Evertz, Niklas Risse, Nicolai Neuer, Andreas Müller, Philipp Normann, Gaetano Sapia, Srishti Gupta, David Pape, Soumya Shaw, Devansh Srivastav, Christian Wressnegger, Erwin Quiring, Thorsten Eisenhofer, Daniel Arp, Lea Schönherr | Published: 2025-12-10

データリークやモデルの問題に関する分析を反映した新規ラベル

プロンプトインジェクション

プロンプトリーキング

2025.12.10

文献データベース

Democratizing ML for Enterprise Security: A Self-Sustained Attack Detection Framework

Authors: Sadegh Momeni, Ge Zhang, Birkett Huber, Hamza Harkous, Sam Lipton, Benoit Seguin, Yanis Pavlidis | Published: 2025-12-09

サイバーセキュリティ

データ生成の安全性

プロンプトインジェクション

2025.12.09

文献データベース

ThinkTrap: Denial-of-Service Attacks against Black-box LLM Services via Infinite Thinking

Authors: Yunzhe Li, Jianan Wang, Hongzi Zhu, James Lin, Shan Chang, Minyi Guo | Published: 2025-12-08

DoS対策

プロンプトインジェクション

モデルDoS

2025.12.08

文献データベース