LLM Performance Evaluation

Automatic Pseudo-Harmful Prompt Generation for Evaluating False Refusals in Large Language Models

Authors: Bang An, Sicheng Zhu, Ruiyi Zhang, Michael-Andrei Panaitescu-Liess, Yuancheng Xu, Furong Huang | Published: 2024-09-01
LLM Performance Evaluation
Content Moderation
Prompt Injection

LeCov: Multi-level Testing Criteria for Large Language Models

Authors: Xuan Xie, Jiayang Song, Yuheng Huang, Da Song, Fuyuan Zhang, Felix Juefei-Xu, Lei Ma | Published: 2024-08-20
LLM Performance Evaluation
Test Prioritization
Prompt Injection

Tracing Privacy Leakage of Language Models to Training Data via Adjusted Influence Functions

Authors: Jinxin Liu, Zao Yang | Published: 2024-08-20 | Updated: 2024-09-05
LLM Performance Evaluation
Privacy Protection Method
Evaluation Method

Large Language Models for Secure Code Assessment: A Multi-Language Empirical Study

Authors: Kohei Dozono, Tiago Espinha Gasiba, Andrea Stocco | Published: 2024-08-12
LLM Performance Evaluation
Prompt Injection
Vulnerability Management

A RAG-Based Question-Answering Solution for Cyber-Attack Investigation and Attribution

Authors: Sampath Rajapaksha, Ruby Rani, Erisa Karafili | Published: 2024-08-12
LLM Performance Evaluation
RAG
Cybersecurity

Multimodal Large Language Models for Phishing Webpage Detection and Identification

Authors: Jehyun Lee, Peiyuan Lim, Bryan Hooi, Dinil Mon Divakaran | Published: 2024-08-12
LLM Performance Evaluation
Phishing Detection
Prompt Injection

AttackER: Towards Enhancing Cyber-Attack Attribution with a Named Entity Recognition Dataset

Authors: Pritam Deka, Sampath Rajapaksha, Ruby Rani, Amirah Almutairi, Erisa Karafili | Published: 2024-08-09
LLM Performance Evaluation
Cybersecurity
Prompt Injection

Towards Explainable Network Intrusion Detection using Large Language Models

Authors: Paul R. B. Houssel, Priyanka Singh, Siamak Layeghy, Marius Portmann | Published: 2024-08-08
LLM Performance Evaluation
Network Threat Detection
Prompt Injection

MPC-Minimized Secure LLM Inference

Authors: Deevashwer Rathee, Dacheng Li, Ion Stoica, Hao Zhang, Raluca Popa | Published: 2024-08-07
LLM Performance Evaluation
MPC Algorithm
Model Performance Evaluation

Harnessing the Power of LLMs in Source Code Vulnerability Detection

Authors: Andrew A Mahyari | Published: 2024-08-07
LLM Performance Evaluation
Program Analysis
Vulnerability Management