プロンプトインジェクション

SafePTR: Token-Level Jailbreak Defense in Multimodal LLMs via Prune-then-Restore Mechanism

Authors: Beitao Chen, Xinyu Lyu, Lianli Gao, Jingkuan Song, Heng Tao Shen | Published: 2025-07-02

プロンプトインジェクション

脱獄攻撃手法

透明性と検証

2025.07.02

文献データベース

Are AI-Generated Fixes Secure? Analyzing LLM and Agent Patches on SWE-bench

Authors: Amirali Sajadi, Kostadin Damevski, Preetha Chatterjee | Published: 2025-06-30 | Updated: 2025-07-24

ソフトウェアセキュリティ

プロンプトインジェクション

大規模言語モデル

2025.06.30

文献データベース

RawMal-TF: Raw Malware Dataset Labeled by Type and Family

Authors: David Bálik, Martin Jureček, Mark Stamp | Published: 2025-06-30

バックドアモデルの検知

プロンプトインジェクション

マルウェア分類のためのデータセット

2025.06.30

文献データベース

VERA: Variational Inference Framework for Jailbreaking Large Language Models

Authors: Anamika Lochab, Lu Yan, Patrick Pynadath, Xiangyu Zhang, Ruqi Zhang | Published: 2025-06-27 | Updated: 2025-11-06

プロンプトインジェクション

プロンプトリーキング

生成モデルの課題

2025.06.27

文献データベース

SV-LLM: An Agentic Approach for SoC Security Verification using Large Language Models

Authors: Dipayan Saha, Shams Tarek, Hasan Al Shaikh, Khan Thamid Hasan, Pavan Sai Nalluri, Md. Ajoad Hasan, Nashmin Alam, Jingbo Zhou, Sujan Kumar Saha, Mark Tehranipoor, Farimah Farahmandi | Published: 2025-06-25

セキュリティ検証手法

プロンプトインジェクション

大規模言語モデル

2025.06.25

文献データベース

Breaking the Boundaries of Long-Context LLM Inference: Adaptive KV Management on a Single Commodity GPU

Authors: He Sun, Li Li, Mingjun Xiao, Chengzhong Xu | Published: 2025-06-25

プロンプトインジェクション

メモリ管理手法

評価手法

2025.06.25

文献データベース

FuncVul: An Effective Function Level Vulnerability Detection Model using LLM and Code Chunk

Authors: Sajal Halder, Muhammad Ejaz Ahmed, Seyit Camtepe | Published: 2025-06-24

プロンプトインジェクション

大規模言語モデル

脆弱性研究

2025.06.24

文献データベース

Security Assessment of DeepSeek and GPT Series Models against Jailbreak Attacks

Authors: Xiaodong Wu, Xiangman Li, Jianbing Ni | Published: 2025-06-23

プロンプトインジェクション

モデルアーキテクチャ

大規模言語モデル

2025.06.23

文献データベース

Privacy-Preserving LLM Interaction with Socratic Chain-of-Thought Reasoning and Homomorphically Encrypted Vector Databases

Authors: Yubeen Bae, Minchan Kim, Jaejin Lee, Sangbum Kim, Jaehyung Kim, Yejin Choi, Niloofar Mireshghallah | Published: 2025-06-19 | Updated: 2025-07-01

プライバシー保護

プロンプトインジェクション

大規模言語モデル

2025.06.19

文献データベース

Evaluating Large Language Models for Phishing Detection, Self-Consistency, Faithfulness, and Explainability

Authors: Shova Kuikel, Aritran Piplai, Palvi Aggarwal | Published: 2025-06-16

アライメント

プロンプトインジェクション

大規模言語モデル

2025.06.16

文献データベース