LLM性能評価

PR-Attack: Coordinated Prompt-RAG Attacks on Retrieval-Augmented Generation in Large Language Models via Bilevel Optimization

Authors: Yang Jiao, Xiaodong Wang, Kai Yang | Published: 2025-04-10
LLM性能評価
RAGへのポイズニング攻撃
敵対的攻撃評価

TrafficLLM: Enhancing Large Language Models for Network Traffic Analysis with Generic Traffic Representation

Authors: Tianyu Cui, Xinjie Lin, Sijia Li, Miao Chen, Qilei Yin, Qi Li, Ke Xu | Published: 2025-04-05 | Updated: 2025-04-15
LLM性能評価
タスク特化型チューニング
モデルの堅牢性

On Benchmarking Code LLMs for Android Malware Analysis

Authors: Yiling He, Hongyu She, Xingzhi Qian, Xinran Zheng, Zhuo Chen, Zhan Qin, Lorenzo Cavallaro | Published: 2025-04-01 | Updated: 2025-04-23
LLM性能評価
マルウェア検出手法
研究方法論

Queueing, Predictions, and LLMs: Challenges and Open Problems

Authors: Michael Mitzenmacher, Rana Shahout | Published: 2025-03-10
LLM性能評価
スケジューリング手法
予測に基づくスケジューリング

AgentBreeder: Mitigating the AI Safety Impact of Multi-Agent Scaffolds via Self-Improvement

Authors: J Rosser, Jakob Nicolaus Foerster | Published: 2025-02-02 | Updated: 2025-04-14
LLM性能評価
マルチオブジェクティブ最適化
安全性アライメント

Logic Meets Magic: LLMs Cracking Smart Contract Vulnerabilities

Authors: ZeKe Xiao, Qin Wang, Hammond Pearce, Shiping Chen | Published: 2025-01-13
LLM性能評価
サイバーセキュリティ
スマートコントラクト

MPCache: MPC-Friendly KV Cache Eviction for Efficient Private Large Language Model Inference

Authors: Wenxuan Zeng, Ye Dong, Jinjin Zhou, Junming Ma, Jin Tan, Runsheng Wang, Meng Li | Published: 2025-01-12
LLM性能評価
MPCアルゴリズム
トークン収集手法

Automating the Detection of Code Vulnerabilities by Analyzing GitHub Issues

Authors: Daniele Cipollone, Changjie Wang, Mariano Scazzariello, Simone Ferlin, Maliheh Izadi, Dejan Kostic, Marco Chiesa | Published: 2025-01-09
LLM性能評価
プロンプトインジェクション
脆弱性管理

SpaLLM-Guard: Pairing SMS Spam Detection Using Open-source and Commercial LLMs

Authors: Muhammad Salman, Muhammad Ikram, Nardine Basta, Mohamed Ali Kaafar | Published: 2025-01-09
LLM性能評価
プロンプトインジェクション
学習の改善

LLM4CVE: Enabling Iterative Automated Vulnerability Repair with Large Language Models

Authors: Mohamad Fakih, Rahul Dharmaji, Halima Bouzidi, Gustavo Quiros Araya, Oluwatosin Ogundare, Mohammad Abdullah Al Faruque | Published: 2025-01-07
LLM性能評価
プロンプトエンジニアリング
自動脆弱性修復