LLM性能評価

Prompt Control-Flow Integrity: A Priority-Aware Runtime Defense Against Prompt Injection in LLM Systems

Authors: Md Takrim Ul Alam, Akif Islam, Mohd Ruhul Ameen, Abu Saleh Musa Miah, Jungpil Shin | Published: 2026-03-19
LLM性能評価
インダイレクトプロンプトインジェクション
評価手法

Network- and Device-Level Cyber Deception for Contested Environments Using RL and LLMs

Authors: Abhijeet Sahu, Shuva Paul, Rochard Macwan | Published: 2026-03-18
LLM性能評価
RAGへのポイズニング攻撃
評価手法

TOSSS: a CVE-based Software Security Benchmark for Large Language Models

Authors: Marc Damie, Murat Bilgehan Ertan, Domenico Essoussi, Angela Makhanu, Gaëtan Peter, Roos Wensveen | Published: 2026-03-11
LLM性能評価
プロンプトインジェクション
評価手法

CacheSolidarity: Preventing Prefix Caching Side Channels in Multi-tenant LLM Serving Systems

Authors: Panagiotis Georgios Pennas, Konstantinos Papaioannou, Marco Guarnieri, Thaleia Dimitra Doudali | Published: 2026-03-11
LLM性能評価
プロンプトインジェクション
プロンプトリーキング

CyberThreat-Eval: Can Large Language Models Automate Real-World Threat Research?

Authors: Xiangsen Chen, Xuan Feng, Shuo Chen, Matthieu Maitre, Sudipto Rakshit, Diana Duvieilh, Ashley Picone, Nan Tang | Published: 2026-03-10
LLMの安全機構の解除
LLM性能評価
インダイレクトプロンプトインジェクション

FlexServe: A Fast and Secure LLM Serving System for Mobile Devices with Flexible Resource Isolation

Authors: Yinpeng Wu, Yitong Chen, Lixiang Wang, Jinyu Gu, Zhichao Hua, Yubin Xia | Published: 2026-03-10
LLM性能評価
セキュアメモリ設計
大規模言語モデル

SCAFFOLD-CEGIS: Preventing Latent Security Degradation in LLM-Driven Iterative Code Refinement

Authors: Yi Chen, Yun Bian, Haiquan Wang, Shihao Li, Zhe Cui | Published: 2026-03-09
LLM性能評価
プログラム解析
脆弱性評価

Good-Enough LLM Obfuscation (GELO)

Authors: Anatoly Belikov, Ilya Fedotov | Published: 2026-03-05
LLM性能評価
データプライバシー管理

A Multi-Dimensional Quality Scoring Framework for Decentralized LLM Inference with Proof of Quality

Authors: Arther Tian, Alex Ding, Frank Chen, Simon Wu, Aaron Chan | Published: 2026-03-04
LLM性能評価
アライメント
評価指標

When Safety Becomes a Vulnerability: Exploiting LLM Alignment Homogeneity for Transferable Blocking in RAG

Authors: Junchen Li, Chao Qi, Rongzheng Wang, Qizhi Chen, Liang Xu, Di Liang, Bob Simons, Shuang Liang | Published: 2026-03-04
LLM性能評価
RAG
RAGへのポイズニング攻撃