防御手法 | AIセキュリティポータル

Context-Aware Hierarchical Learning: A Two-Step Paradigm towards Safer LLMs

Authors: Tengyun Ma, Jiaqi Yao, Daojing He, Shihao Peng, Yu Li, Shaohui Liu, Zhuotao Tian | Published: 2025-12-03

セキュリティ考慮

倫理的考慮

防御手法

2025.12.03

文献データベース

Model Inversion Attacks Meet Cryptographic Fuzzy Extractors

Authors: Mallika Prabhakar, Louise Xu, Prateek Saxena | Published: 2025-10-29

メンバーシップ推論

モデルインバージョン

防御手法

2025.10.29

文献データベース

NetEcho: From Real-World Streaming Side-Channels to Full LLM Conversation Recovery

Authors: Zheng Zhang, Guanlong Wu, Sen Deng, Shuai Wang, Yinqian Zhang | Published: 2025-10-29

ネットワークトラフィック分析

モデル抽出攻撃

防御手法

2025.10.29

文献データベース

An In-Depth Analysis of Cyber Attacks in Secured Platforms

Authors: Parick Ozoh, John K Omoniyi, Bukola Ibitoye | Published: 2025-10-29

サイバー脅威

プライバシー漏洩

防御手法

2025.10.29

文献データベース

Rethinking Reasoning: A Survey on Reasoning-based Backdoors in LLMs

Authors: Man Hu, Xinyi Wu, Zuofeng Suo, Jinbo Feng, Linghui Meng, Yanhao Jia, Anh Tuan Luu, Shuai Zhao | Published: 2025-10-09

プロンプトリーキング

推論に基づくバックドア攻撃

防御手法

2025.10.09

文献データベース

DDoS Attacks in Cloud Computing: Detection and Prevention

Authors: Zain Ahmad, Musab Ahmad, Bilal Ahmad | Published: 2025-08-19

リソース使用分析

攻撃タイプ

防御手法

2025.08.19

文献データベース

Secure Tug-of-War (SecTOW): Iterative Defense-Attack Training with Reinforcement Learning for Multimodal Model Security

Authors: Muzhi Dai, Shixuan Liu, Zhiyuan Zhao, Junyu Gao, Hao Sun, Xuelong Li | Published: 2025-07-29

報酬メカニズム設計

強化学習最適化

防御手法

2025.07.29

文献データベース

Thought Purity: Defense Paradigm For Chain-of-Thought Attack

Authors: Zihao Xue, Zhen Bi, Long Ma, Zhenlin Hu, Yan Wang, Zhenfang Liu, Qing Sheng, Jie Xiao, Jungang Lou | Published: 2025-07-16

情報セキュリティ

脅威モデリング

防御手法

2025.07.16

文献データベース

Defending Against Prompt Injection With a Few DefensiveTokens

Authors: Sizhe Chen, Yizhu Wang, Nicholas Carlini, Chawin Sitawarin, David Wagner | Published: 2025-07-10

インダイレクトプロンプトインジェクション

プロンプトリーキング

防御手法

2025.07.10

文献データベース

May I have your Attention? Breaking Fine-Tuning based Prompt Injection Defenses using Architecture-Aware Attacks

Authors: Nishit V. Pandya, Andrey Labunets, Sicun Gao, Earlence Fernandes | Published: 2025-07-10

インダイレクトプロンプトインジェクション

敵対的攻撃

防御手法

2025.07.10

文献データベース