LLM性能評価

From Generalist to Specialist: Exploring CWE-Specific Vulnerability Detection

Authors: Syafiq Al Atiiq, Christian Gehrmann, Kevin Dahlén, Karim Khalil | Published: 2024-08-05

LLM性能評価

モデル性能評価

脆弱性管理

2024.08.05 2025.04.03

文献データベース

LLM as Runtime Error Handler: A Promising Pathway to Adaptive Self-Healing of Software Systems

Authors: Zhensu Sun, Haotian Zhu, Bowen Xu, Xiaoning Du, Li Li, David Lo | Published: 2024-08-02

LLM性能評価

プログラム解析

自己修復システム

2024.08.02 2025.04.03

文献データベース

GoldCoin: Grounding Large Language Models in Privacy Laws via Contextual Integrity Theory

Authors: Wei Fan, Haoran Li, Zheye Deng, Weiqi Wang, Yangqiu Song | Published: 2024-06-17 | Updated: 2024-10-04

LLM性能評価

プライバシー保護手法

プロンプトインジェクション

2024.06.17 2025.04.03

文献データベース

LLAMAFUZZ: Large Language Model Enhanced Greybox Fuzzing

Authors: Hongxiang Zhang, Yuyang Rong, Yifeng He, Hao Chen | Published: 2024-06-11 | Updated: 2024-06-13

LLM性能評価

ファジング

プロンプトインジェクション

2024.06.11 2025.04.03

文献データベース

Beyond Words: On Large Language Models Actionability in Mission-Critical Risk Analysis

Authors: Matteo Esposito, Francesco Palagiano, Valentina Lenarduzzi, Davide Taibi | Published: 2024-06-11 | Updated: 2024-09-06

LLM性能評価

RAG

リスク管理

2024.06.11 2025.04.03

文献データベース

VulDetectBench: Evaluating the Deep Capability of Vulnerability Detection with Large Language Models

Authors: Yu Liu, Lang Gao, Mingxin Yang, Yu Xie, Ping Chen, Xiaojin Zhang, Wei Chen | Published: 2024-06-11 | Updated: 2024-08-21

LLM性能評価

モデル性能評価

脆弱性管理

2024.06.11 2025.04.03

文献データベース

MLLMGuard: A Multi-dimensional Safety Evaluation Suite for Multimodal Large Language Models

Authors: Tianle Gu, Zeyang Zhou, Kexin Huang, Dandan Liang, Yixu Wang, Haiquan Zhao, Yuanqi Yao, Xingge Qiao, Keqing Wang, Yujiu Yang, Yan Teng, Yu Qiao, Yingchun Wang | Published: 2024-06-11 | Updated: 2024-06-13

LLM性能評価

データセット生成

評価手法

2024.06.11 2025.04.03

文献データベース

Ollabench: Evaluating LLMs’ Reasoning for Human-centric Interdependent Cybersecurity

Authors: Tam n. Nguyen | Published: 2024-06-11

LLM性能評価

サイバーセキュリティ

評価手法

2024.06.11 2025.04.03

文献データベース

SecureNet: A Comparative Study of DeBERTa and Large Language Models for Phishing Detection

Authors: Sakshi Mahendru, Tejul Pandit | Published: 2024-06-10

LLM性能評価

フィッシング検出

プロンプトインジェクション

2024.06.10 2025.04.03

文献データベース

A Novel Generative AI-Based Framework for Anomaly Detection in Multicast Messages in Smart Grid Communications

Authors: Aydin Zaboli, Seong Lok Choi, Tai-Jin Song, Junho Hong | Published: 2024-06-08

LLM性能評価

サイバーセキュリティ

異常検出手法

2024.06.08 2025.04.03

文献データベース