LLM性能評価

Towards a standardized methodology and dataset for evaluating LLM-based digital forensic timeline analysis

Authors: Hudan Studiawan, Frank Breitinger, Mark Scanlon | Published: 2025-05-06

LLM性能評価

大規模言語モデル

評価手法

2025.05.06

文献データベース

LLM-Based Threat Detection and Prevention Framework for IoT Ecosystems

Authors: Yazan Otoum, Arghavan Asad, Amiya Nayak | Published: 2025-05-01

AIによる出力のバイアスの検出

LLM性能評価

プロンプトインジェクション

2025.05.01

文献データベース

Hoist with His Own Petard: Inducing Guardrails to Facilitate Denial-of-Service Attacks on Retrieval-Augmented Generation of LLMs

Authors: Pan Suo, Yu-Ming Shang, San-Chuan Guo, Xi Zhang | Published: 2025-04-30

LLM性能評価

RAGへのポイズニング攻撃

攻撃タイプ

2025.04.30

文献データベース

Case Study: Fine-tuning Small Language Models for Accurate and Private CWE Detection in Python Code

Authors: Md. Azizul Hakim Bappy, Hossen A Mustafa, Prottoy Saha, Rajinus Salehat | Published: 2025-04-23

LLM性能評価

トレーニング手法

プロンプトリーキング

2025.04.23

文献データベース

aiXamine: LLM Safety and Security Simplified

Authors: Fatih Deniz, Dorde Popovic, Yazan Boshmaf, Euisuh Jeong, Minhaj Ahmad, Sanjay Chawla, Issa Khalil | Published: 2025-04-21

LLM性能評価

アライメント

パフォーマンス評価

2025.04.21

文献データベース

Watermarking Needs Input Repetition Masking

Authors: David Khachaturov, Robert Mullins, Ilia Shumailov, Sumanth Dathathri | Published: 2025-04-16

LLM性能評価

プロンプトの検証

透かし設計

2025.04.16

文献データベース

The Digital Cybersecurity Expert: How Far Have We Come?

Authors: Dawei Wang, Geng Zhou, Xianglong Li, Yu Bai, Li Chen, Ting Qin, Jian Sun, Dan Li | Published: 2025-04-16

LLM性能評価

RAGへのポイズニング攻撃

プロンプトインジェクション

2025.04.16

文献データベース

Progent: Programmable Privilege Control for LLM Agents

Authors: Tianneng Shi, Jingxuan He, Zhun Wang, Linyu Wu, Hongwei Li, Wenbo Guo, Dawn Song | Published: 2025-04-16

LLM性能評価

インダイレクトプロンプトインジェクション

プライバシー保護メカニズム

2025.04.16

文献データベース

Exploring Backdoor Attack and Defense for LLM-empowered Recommendations

Authors: Liangbo Ning, Wenqi Fan, Qing Li | Published: 2025-04-15

LLM性能評価

RAGへのポイズニング攻撃

敵対的攻撃分析

2025.04.15

文献データベース

Bypassing Prompt Injection and Jailbreak Detection in LLM Guardrails

Authors: William Hackett, Lewis Birch, Stefan Trawicki, Neeraj Suri, Peter Garraghan | Published: 2025-04-15

LLM性能評価

プロンプトインジェクション

敵対的攻撃分析

2025.04.15

文献データベース