攻撃手法 | ページ 3 | AIセキュリティポータル

Personalized Attacks of Social Engineering in Multi-turn Conversations — LLM Agents for Simulation and Detection

Authors: Tharindu Kumarage, Cameron Johnson, Jadie Adams, Lin Ai, Matthias Kirchner, Anthony Hoogs, Joshua Garland, Julia Hirschberg, Arslan Basharat, Huan Liu | Published: 2025-03-18

アライメント

ソーシャルエンジニアリング攻撃

攻撃手法

2025.03.18 2025.04.03

文献データベース

Anomaly-Flow: A Multi-domain Federated Generative Adversarial Network for Distributed Denial-of-Service Detection

Authors: Leonardo Henrique de Melo, Gustavo de Carvalho Bertoli, Michele Nogueira, Aldri Luiz dos Santos, Lourenço Alves Pereira Junior | Published: 2025-03-18

サイバー脅威

データ生成手法

攻撃手法

2025.03.18 2025.04.03

文献データベース

MirrorGuard: Adaptive Defense Against Jailbreaks via Entropy-Guided Mirror Crafting

Authors: Rui Pu, Chaozhuo Li, Rui Ha, Litian Zhang, Lirong Qiu, Xi Zhang | Published: 2025-03-17

プロンプトインジェクション

大規模言語モデル

攻撃手法

2025.03.17 2025.04.03

文献データベース

Prompt Flow Integrity to Prevent Privilege Escalation in LLM Agents

Authors: Juhee Kim, Woohyuk Choi, Byoungyoung Lee | Published: 2025-03-17

インダイレクトプロンプトインジェクション

データ流分析

攻撃手法

2025.03.17 2025.04.03

文献データベース

BLIA: Detect model memorization in binary classification model through passive Label Inference attack

Authors: Mohammad Wahiduzzaman Khan, Sheng Chen, Ilya Mironov, Leizhen Zhang, Rabib Noor | Published: 2025-03-17

データキュレーション

差分プライバシー

攻撃手法

2025.03.17 2025.04.03

文献データベース

Winning the MIDST Challenge: New Membership Inference Attacks on Diffusion Models for Tabular Data Synthesis

Authors: Xiaoyu Wu, Yifei Pang, Terrance Liu, Steven Wu | Published: 2025-03-15

データ生成手法

メンバーシップ開示リスク

攻撃手法

2025.03.15 2025.04.03

文献データベース

Trust Under Siege: Label Spoofing Attacks against Machine Learning for Android Malware Detection

Authors: Tianwei Lan, Luca Demetrio, Farid Nait-Abdesselam, Yufei Han, Simone Aonzo | Published: 2025-03-14

バックドア攻撃

ラベル

攻撃手法

2025.03.14 2025.04.03

文献データベース

Siege: Autonomous Multi-Turn Jailbreaking of Large Language Models with Tree Search

Authors: Andy Zhou | Published: 2025-03-13 | Updated: 2025-03-16

LLMの安全機構の解除

攻撃手法

生成モデル

2025.03.13 2025.04.03

文献データベース

Mind the Gap: Detecting Black-box Adversarial Attacks in the Making through Query Update Analysis

Authors: Jeonghwan Park, Niall McLaughlin, Ihsen Alouani | Published: 2025-03-04 | Updated: 2025-03-16

攻撃手法

敵対的サンプルの検知

深層学習

2025.03.04 2025.04.03

文献データベース

Can Indirect Prompt Injection Attacks Be Detected and Removed?

Authors: Yulin Chen, Haoran Li, Yuan Sui, Yufei He, Yue Liu, Yangqiu Song, Bryan Hooi | Published: 2025-02-23

プロンプトの検証

悪意のあるプロンプト

攻撃手法

2025.02.23 2025.04.03

文献データベース