AIセキュリティポータルbot

TZ-LLM: Protecting On-Device Large Language Models with Arm TrustZone

Authors: Xunjie Wang, Jiacheng Shi, Zihan Zhao, Yang Yu, Zhichao Hua, Jinyu Gu | Published: 2025-11-17
プロンプトリーキング
モデルDoS
性能評価指標

Robust Client-Server Watermarking for Split Federated Learning

Authors: Jiaxiong Tang, Zhengchunmin Dai, Liantao Wu, Peng Sun, Honglong Chen, Zhenfu Cao | Published: 2025-11-17
トリガーの検知
プライバシー手法
透かし評価

ForgeDAN: An Evolutionary Framework for Jailbreaking Aligned Large Language Models

Authors: Siyang Cheng, Gaotian Liu, Rui Mei, Yilin Wang, Kejia Zhang, Kaishuo Wei, Yuqi Yu, Weiping Wen, Xiaojie Wu, Junhua Liu | Published: 2025-11-17
プロンプトインジェクション
大規模言語モデル
進化的アルゴリズム

Interpretable Ransomware Detection Using Hybrid Large Language Models: A Comparative Analysis of BERT, RoBERTa, and DeBERTa Through LIME and SHAP

Authors: Elodie Mutombo Ngoie, Mike Nkongolo Wa Nkongolo, Peace Azugo, Mahmut Tokmak | Published: 2025-11-17
メンバーシップ推論
深層学習に基づくIDS
特徴選択手法

Tight and Practical Privacy Auditing for Differentially Private In-Context Learning

Authors: Yuyang Xia, Ruixuan Liu, Li Xiong | Published: 2025-11-17
プライバシー手法
匿名化技術
差分プライバシー

Enhancing All-to-X Backdoor Attacks with Optimized Target Class Mapping

Authors: Lei Wang, Yulong Tian, Hao Han, Fengyuan Xu | Published: 2025-11-17
トリガーの検知
バックドア攻撃
透かし評価

Whistledown: Combining User-Level Privacy with Conversational Coherence in LLMs

Authors: Chelsea McMurray, Hayder Tirmazi | Published: 2025-11-17
プライバシーリスク管理
プライバシー保証
プライバシー手法

DualTAP: A Dual-Task Adversarial Protector for Mobile MLLM Agents

Authors: Fuyao Zhang, Jiaming Zhang, Che Wang, Xiongtao Sun, Yurong Hao, Guowei Guan, Wenjie Li, Longtao Huang, Wei Yang Bryan Lim | Published: 2025-11-17
プライバシー手法
生成モデル
透かし評価

SmartPoC: Generating Executable and Validated PoCs for Smart Contract Bug Reports

Authors: Longfei Chen, Ruibin Yan, Taiyu Wong, Yiyang Chen, Chao Zhang | Published: 2025-11-17
性能評価指標
自動生成フレームワーク
透かし評価

SafeGRPO: Self-Rewarded Multimodal Safety Alignment via Rule-Governed Policy Optimization

Authors: Xuankun Rong, Wenke Huang, Tingfeng Wang, Daiguo Zhou, Bo Du, Mang Ye | Published: 2025-11-17
プライバシー手法
不適切コンテンツ生成
倫理的選択評価