Prompt Injection

PIG: Privacy Jailbreak Attack on LLMs via Gradient-based Iterative In-Context Optimization

Authors: Yidan Wang, Yanan Cao, Yubing Ren, Fang Fang, Zheng Lin, Binxing Fang | Published: 2025-05-15

Disabling Safety Mechanisms of LLM

Prompt Injection

Privacy Protection in Machine Learning

2025.05.15 2025.05.28

Literature Database

SecReEvalBench: A Multi-turned Security Resilience Evaluation Benchmark for Large Language Models

Authors: Huining Cui, Wei Liu | Published: 2025-05-12

LLM Security

Prompt Injection

Prompt leaking

2025.05.12 2025.05.28

Literature Database

Security through the Eyes of AI: How Visualization is Shaping Malware Detection

Authors: Asmitha K. A., Matteo Brosolo, Serena Nicolazzo, Antonino Nocera, Vinod P., Rafidha Rehiman K. A., Muhammed Shafi K. P | Published: 2025-05-12

Prompt Injection

Malware Classification

Adversarial Example Detection

2025.05.12 2025.05.28

Literature Database

One Trigger Token Is Enough: A Defense Strategy for Balancing Safety and Usability in Large Language Models

Authors: Haoran Gu, Handing Wang, Yi Mei, Mengjie Zhang, Yaochu Jin | Published: 2025-05-12

LLM Security

Disabling Safety Mechanisms of LLM

Prompt Injection

2025.05.12 2025.05.28

Literature Database

Red Teaming the Mind of the Machine: A Systematic Evaluation of Prompt Injection and Jailbreak Vulnerabilities in LLMs

Authors: Chetan Pathade | Published: 2025-05-07 | Updated: 2025-05-13

LLM Security

Disabling Safety Mechanisms of LLM

Prompt Injection

2025.05.07 2025.05.28

Literature Database

Safeguard-by-Development: A Privacy-Enhanced Development Paradigm for Multi-Agent Collaboration Systems

Authors: Jian Cui, Zichuan Li, Luyi Xing, Xiaojing Liao | Published: 2025-05-07 | Updated: 2025-06-24

Privacy Protection

Privacy protection framework

Prompt Injection

2025.05.07 2025.06.26

Literature Database

LlamaFirewall: An open source guardrail system for building secure AI agents

Authors: Sahana Chennabasappa, Cyrus Nikolaidis, Daniel Song, David Molnar, Stephanie Ding, Shengye Wan, Spencer Whitman, Lauren Deason, Nicholas Doucette, Abraham Montilla, Alekhya Gampa, Beto de Paola, Dominik Gabi, James Crnkovich, Jean-Christophe Testud, Kat He, Rashnil Chaturvedi, Wu Zhou, Joshua Saxe | Published: 2025-05-06

LLM Security

Alignment

Prompt Injection

2025.05.06 2025.05.27

Literature Database

Directed Greybox Fuzzing via Large Language Model

Authors: Hanxiang Xu, Yanjie Zhao, Haoyu Wang | Published: 2025-05-06

RAG

Prompt Injection

Vulnerability Analysis

2025.05.06 2025.05.27

Literature Database

LLM-Based Threat Detection and Prevention Framework for IoT Ecosystems

Authors: Yazan Otoum, Arghavan Asad, Amiya Nayak | Published: 2025-05-01 | Updated: 2025-05-13

Bias Detection in AI Output

LLM Performance Evaluation

Prompt Injection

2025.05.01 2025.05.27

Literature Database

An Empirical Study on the Effectiveness of Large Language Models for Binary Code Understanding

Authors: Xiuwei Shang, Zhenkan Fu, Shaoyin Cheng, Guoqiang Chen, Gangyang Li, Li Hu, Weiming Zhang, Nenghai Yu | Published: 2025-04-30

Program Analysis

Prompt Injection

Prompt leaking

2025.04.30 2025.05.27

Literature Database