AIセキュリティポータルbot

Position: LLM Watermarking Should Align Stakeholders’ Incentives for Practical Adoption

Authors: Yepeng Liu, Xuandong Zhao, Dawn Song, Gregory W. Wornell, Yuheng Bu | Published: 2025-10-21
Incentive Mechanism
Digital Watermarking for Generative AI
Robustness of Watermarking Techniques

RESCUE: Retrieval Augmented Secure Code Generation

Authors: Jiahao Shi, Tianyi Zhang | Published: 2025-10-21
Poisoning attack on RAG
Data-Driven Vulnerability Assessment
Prompt leaking

VERA-V: Variational Inference Framework for Jailbreaking Vision-Language Models

Authors: Qilin Liao, Anamika Lochab, Ruqi Zhang | Published: 2025-10-20
Model DoS
Large Language Model
Untargeted Toxicity Attack

CrossGuard: Safeguarding MLLMs against Joint-Modal Implicit Malicious Attacks

Authors: Xu Zhang, Hao Li, Zhichao Lu | Published: 2025-10-20
Query Generation Method
Prompt Injection
Large Language Model

GUIDE: Enhancing Gradient Inversion Attacks in Federated Learning with Denoising Models

Authors: Vincenzo Carletti, Pasquale Foggia, Carlo Mazzocca, Giuseppe Parrella, Mario Vento | Published: 2025-10-20
Privacy Analysis
Reconstruction Attack
Federated Learning

Multimodal Safety Is Asymmetric: Cross-Modal Exploits Unlock Black-Box MLLMs Jailbreaks

Authors: Xinkai Wang, Beibei Li, Zerui Shao, Ao Liu, Shouling Ji | Published: 2025-10-20
Disabling Safety Mechanisms of LLM
Prompt Injection
Malicious Content Generation

Exploiting the Potential of Linearity in Automatic Differentiation and Computational Cryptography

Authors: Giulia Giusti | Published: 2025-10-20
Encryption Technology
Watermark Design
Quantum Computing Method

QRïS: A Preemptive Novel Method for Quishing Detection Through Structural Features of QR

Authors: Muhammad Wahid Akram, Keshav Sood, Muneeb Ul Hassan | Published: 2025-10-20
QRコード分類手法
Feature Importance Analysis
評価メトリクス

SentinelNet: Safeguarding Multi-Agent Collaboration Through Credit-Based Dynamic Threat Detection

Authors: Yang Feng, Xudong Pan | Published: 2025-10-17 | Updated: 2025-10-21
エージェント設計
Network Threat Detection
Model Robustness

SoK: Taxonomy and Evaluation of Prompt Security in Large Language Models

Authors: Hanbin Hong, Shuya Feng, Nima Naderloui, Shenao Yan, Jingyu Zhang, Biying Liu, Ali Arastehfard, Heqing Huang, Yuan Hong | Published: 2025-10-17 | Updated: 2025-10-21
LLM Security
シナリオベースの悪用
Large Language Model