AIセキュリティポータルbot | ページ 51 | AIセキュリティポータル

LLM-Assisted Web Measurements

Authors: Simone Bozzolan, Stefano Calzavara, Lorenzo Cazzaro | Published: 2025-10-09

AIによる出力のバイアスの検出

アプリ分類手法

プロンプトインジェクション

2025.10.09

文献データベース

A Novel Ensemble Learning Approach for Enhanced IoT Attack Detection: Redefining Security Paradigms in Connected Systems

Authors: Hikmat A. M. Abdeljaber, Md. Alamgir Hossain, Sultan Ahmad, Ahmed Alsanad, Md Alimul Haque, Sudan Jha, Jabeen Nazeer | Published: 2025-10-09

IoTセキュリティ課題

防御メカニズム

防御効果分析

2025.10.09

文献データベース

Fewer Weights, More Problems: A Practical Attack on LLM Pruning

Authors: Kazuki Egashira, Robin Staab, Thibaud Gloaguen, Mark Vero, Martin Vechev | Published: 2025-10-09

セキュリティ分析手法

プロンプトインジェクション

防御効果分析

2025.10.09

文献データベース

From Defender to Devil? Unintended Risk Interactions Induced by LLM Defenses

Authors: Xiangtao Meng, Tianshuo Cong, Li Wang, Wenyu Chen, Zheng Li, Shanqing Guo, Xiaoyun Wang | Published: 2025-10-09

アライメント

インダイレクトプロンプトインジェクション

防御効果分析

2025.10.09

文献データベース

MetaDefense: Defending Finetuning-based Jailbreak Attack Before and During Generation

Authors: Weisen Jiang, Sinno Jialin Pan | Published: 2025-10-09

プロンプトインジェクション

ロバスト性

防御メカニズム

2025.10.09

文献データベース

Rethinking Reasoning: A Survey on Reasoning-based Backdoors in LLMs

Authors: Man Hu, Xinyi Wu, Zuofeng Suo, Jinbo Feng, Linghui Meng, Yanhao Jia, Anh Tuan Luu, Shuai Zhao | Published: 2025-10-09

プロンプトリーキング

推論に基づくバックドア攻撃

防御手法

2025.10.09

文献データベース

Proactive defense against LLM Jailbreak

Authors: Weiliang Zhao, Jinjun Peng, Daniel Ben-Levi, Zhou Yu, Junfeng Yang | Published: 2025-10-06

LLMの安全機構の解除

プロンプトインジェクション

防御手法の統合

2025.10.06

文献データベース

What your brain activity says about you: A review of neuropsychiatric disorders identified in resting-state and sleep EEG data

Authors: J. E. M. Scanlon, A. Pelzer, M. Gharleghi, K. C. Fuhrmeister, T. Köllmer, P. Aichroth, R. Göder, C. Hansen, K. I. Wolf | Published: 2025-10-06

プライバシー保護機械学習

信号処理

医療診断属性

2025.10.06

文献データベース

Federated Computation of ROC and PR Curves

Authors: Xuefeng Xu, Graham Cormode | Published: 2025-10-06

トレードオフ分析

プライバシー保護機械学習

負の入力の近似誤差

2025.10.06

文献データベース

Unified Threat Detection and Mitigation Framework (UTDMF): Combating Prompt Injection, Deception, and Bias in Enterprise-Scale Transformers

Authors: Santhosh KumarRavindran | Published: 2025-10-06

インダイレクトプロンプトインジェクション

バイアス緩和手法

防御手法の統合

2025.10.06

文献データベース