AIセキュリティポータルbot | Page 41

Hybrid Fuzzing with LLM-Guided Input Mutation and Semantic Feedback

Authors: Shiyin Lin | Published: 2025-11-06

Prompt Injection

Dynamic Analysis

Information Security

2025.11.06 2025.11.08

Literature Database

Whisper Leak: a side-channel attack on Large Language Models

Authors: Geoff McDonald, Jonathan Bar Or | Published: 2025-11-05

Traffic Characteristic Analysis

Prompt leaking

Large Language Model

2025.11.05 2025.11.07

Literature Database

Watermarking Large Language Models in Europe: Interpreting the AI Act in Light of Technology

Authors: Thomas Souverain | Published: 2025-11-05

Digital Watermarking for Generative AI

Generative Model Characteristics

Transparency and Verification

2025.11.05 2025.11.07

Literature Database

Let the Bees Find the Weak Spots: A Path Planning Perspective on Multi-Turn Jailbreak Attacks against LLMs

Authors: Yize Liu, Yunyun Hou, Aina Sui | Published: 2025-11-05

Automation of Cybersecurity

Prompt Injection

マルチターン攻撃分析

2025.11.05 2025.11.07

Literature Database

Auditing M-LLMs for Privacy Risks: A Synthetic Benchmark and Evaluation Framework

Authors: Junhao Li, Jiahao Chen, Zhou Feng, Chunyi Zhou | Published: 2025-11-05

Hallucination

Privacy Violation

Privacy Protection

2025.11.05 2025.11.07

Literature Database

Death by a Thousand Prompts: Open Model Vulnerability Analysis

Authors: Amy Chang, Nicholas Conley, Harish Santhanalakshmi Ganesan, Adam Swanda | Published: 2025-11-05

Disabling Safety Mechanisms of LLM

Indirect Prompt Injection

Threat modeling

2025.11.05 2025.11.07

Literature Database

Measuring the Security of Mobile LLM Agents under Adversarial Prompts from Untrusted Third-Party Channels

Authors: Chenghao Du, Quanfeng Huang, Tingxuan Tang, Zihao Wang, Adwait Nadkarni, Yue Xiao | Published: 2025-10-31 | Updated: 2025-11-06

Indirect Prompt Injection

Prompt Injection

Information Security

2025.10.31 2025.11.08

Literature Database

PVMark: Enabling Public Verifiability for LLM Watermarking Schemes

Authors: Haohua Duan, Liyao Xiang, Xin Zhang | Published: 2025-10-30

Model Extraction Attack

公的検証可能性

Watermarking Technology

2025.10.30 2025.11.01

Literature Database

ALMGuard: Safety Shortcuts and Where to Find Them as Guardrails for Audio-Language Models

Authors: Weifei Jin, Yuxin Cao, Junjie Su, Minhui Xue, Jie Hao, Ke Xu, Jin Song Dong, Derui Wang | Published: 2025-10-30

Prompt Injection

Impact of Generalization

倫理基準遵守

2025.10.30 2025.11.01

Literature Database

Model Inversion Attacks Meet Cryptographic Fuzzy Extractors

Authors: Mallika Prabhakar, Louise Xu, Prateek Saxena | Published: 2025-10-29

Membership Inference

Model Inversion

Defense Method

2025.10.29 2025.10.31

Literature Database