Prompt Injection

Boosting Jailbreak Attack with Momentum

Authors: Yihao Zhang, Zeming Wei | Published: 2024-05-02

Watermarking

Prompt Injection

Attack Method

2024.05.02 2025.05.27

Literature Database

DLAP: A Deep Learning Augmented Large Language Model Prompting Framework for Software Vulnerability Detection

Authors: Yanjing Yang, Xin Zhou, Runfeng Mao, Jinwei Xu, Lanxin Yang, Yu Zhangm, Haifeng Shen, He Zhang | Published: 2024-05-02

Prompt Injection

Prompt Engineering

Vulnerability Management

2024.05.02 2025.05.27

Literature Database

LLM Security Guard for Code

Authors: Arya Kavian, Mohammad Mehdi Pourhashem Kallehbasti, Sajjad Kazemi, Ehsan Firouzi, Mohammad Ghafari | Published: 2024-05-02 | Updated: 2024-05-03

LLM Security

Security Analysis

Prompt Injection

2024.05.02 2025.05.27

Literature Database

Unleashing the Power of LLM to Infer State Machine from the Protocol Implementation

Authors: Haiyang Wei, Ligeng Chen, Zhengjie Du, Yuhan Wu, Haohui Huang, Yue Liu, Guang Cheng, Fengyuan Xu, Linzhang Wang, Bing Mao | Published: 2024-05-01 | Updated: 2025-03-27

LLM Performance Evaluation

Prompt Injection

State Transition Model

2024.05.01 2025.05.27

Literature Database

TuBA: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning

Authors: Xuanli He, Jun Wang, Qiongkai Xu, Pasquale Minervini, Pontus Stenetorp, Benjamin I. P. Rubinstein, Trevor Cohn | Published: 2024-04-30 | Updated: 2025-03-17

Content Moderation

Backdoor Attack

Prompt Injection

2024.04.30 2025.05.27

Literature Database

Evaluating and Mitigating Linguistic Discrimination in Large Language Models

Authors: Guoliang Dong, Haoyu Wang, Jun Sun, Xinyu Wang | Published: 2024-04-29 | Updated: 2024-05-10

LLM Performance Evaluation

Bias

Prompt Injection

2024.04.29 2025.05.27

Literature Database

Attacks on Third-Party APIs of Large Language Models

Authors: Wanru Zhao, Vidit Khazanchi, Haodi Xing, Xuanli He, Qiongkai Xu, Nicholas Donald Lane | Published: 2024-04-24

LLM Security

Prompt Injection

Attack Method

2024.04.24 2025.05.27

Literature Database

Act as a Honeytoken Generator! An Investigation into Honeytoken Generation with Large Language Models

Authors: Daniel Reti, Norman Becker, Tillmann Angeli, Anasuya Chattopadhyay, Daniel Schneider, Sebastian Vollmer, Hans D. Schotten | Published: 2024-04-24

LLM Performance Evaluation

Honeypot Technology

Prompt Injection

2024.04.24 2025.05.27

Literature Database

zkLLM: Zero Knowledge Proofs for Large Language Models

Authors: Haochen Sun, Jason Li, Hongyang Zhang | Published: 2024-04-24

Prompt Injection

Computational Efficiency

Watermark Robustness

2024.04.24 2025.05.27

Literature Database

Protecting Your LLMs with Information Bottleneck

Authors: Zichuan Liu, Zefan Wang, Linjie Xu, Jinyu Wang, Lei Song, Tianchun Wang, Chunlin Chen, Wei Cheng, Jiang Bian | Published: 2024-04-22 | Updated: 2024-10-10

LLM Security

Prompt Injection

Compliance with Ethical Guidelines

2024.04.22 2025.05.27

Literature Database