SEA: Low-Resource Safety Alignment for Multimodal Large Language Models via Synthetic Embeddings Authors: Weikai Lu, Hao Peng, Huiping Zhuang, Cen Chen, Ziqian Zeng | Published: 2025-02-18 | Updated: 2025-05-21 アライメントテキスト生成手法プロンプトインジェクション 2025.02.18 文献データベース
Toward Integrated Solutions: A Systematic Interdisciplinary Review of Cybergrooming Research Authors: Heajun An, Marcos Silva, Qi Zhang, Arav Singh, Minqian Liu, Xinyi Zhang, Sarvech Qadir, Sang Won Lee, Lifu Huang, Pamela J. Wisniewski, Jin-Hee Cho | Published: 2025-02-18 | Updated: 2025-07-31 サイバーグルーミング研究敵対的学習文献レビュー方法論 2025.02.18 文献データベース
Unveiling Privacy Risks in LLM Agent Memory Authors: Bo Wang, Weiyi He, Shenglai Zeng, Zhen Xiang, Yue Xing, Jiliang Tang, Pengfei He | Published: 2025-02-17 | Updated: 2025-06-03 プライバシー分析プロンプトリーキング情報漏洩の原因 2025.02.17 文献データベース
BackdoorDM: A Comprehensive Benchmark for Backdoor Learning on Diffusion Model Authors: Weilin Lin, Nanjun Zhou, Yanyun Wang, Jianze Li, Hui Xiong, Li Liu | Published: 2025-02-17 | Updated: 2025-07-21 トリガーの検知バックドア攻撃性能評価 2025.02.17 文献データベース
DELMAN: Dynamic Defense Against Large Language Model Jailbreaking with Model Editing Authors: Yi Wang, Fenghua Weng, Sibei Yang, Zhan Qin, Minlie Huang, Wenjie Wang | Published: 2025-02-17 | Updated: 2025-05-29 LLMセキュリティプロンプトインジェクション防御手法 2025.02.17 文献データベース
Nuclear Deployed: Analyzing Catastrophic Risks in Decision-making of Autonomous LLM Agents Authors: Rongwu Xu, Xiaojian Li, Shuo Chen, Wei Xu | Published: 2025-02-17 | Updated: 2025-03-23 インダイレクトプロンプトインジェクション倫理声明意思決定ダイナミクス 2025.02.17 2025.04.03 文献データベース
QueryAttack: Jailbreaking Aligned Large Language Models Using Structured Non-natural Query Language Authors: Qingsong Zou, Jingyu Xiao, Qing Li, Zhi Yan, Yuhang Wang, Li Xu, Wenxuan Wang, Kuofeng Gao, Ruoyu Li, Yong Jiang | Published: 2025-02-13 | Updated: 2025-05-26 LLMの安全機構の解除プロンプトリーキング教育的分析 2025.02.13 文献データベース
A hierarchical approach for assessing the vulnerability of tree-based classification models to membership inference attack Authors: Richard J. Preen, Jim Smith | Published: 2025-02-13 | Updated: 2025-06-12 プライバシー保護手法モデル抽出攻撃リスク評価 2025.02.13 文献データベース
RLSA-PFL: Robust Lightweight Secure Aggregation with Model Inconsistency Detection in Privacy-Preserving Federated Learning Authors: Nazatul H. Sultan, Yan Bo, Yansong Gao, Seyit Camtepe, Arash Mahboubi, Hang Thanh Bui, Aufeef Chauhan, Hamed Aboutorab, Michael Bewong, Dineshkumar Singh, Praveen Gauravaram, Rafiqul Islam, Sharif Abuadbba | Published: 2025-02-13 | Updated: 2025-04-16 プライバシー保護プロトコル性能評価手法連合学習 2025.02.13 文献データベース
RESIST: Resilient Decentralized Learning Using Consensus Gradient Descent Authors: Cheng Fang, Rishabh Dixit, Waheed U. Bajwa, Mert Gurbuzbalaban | Published: 2025-02-11 MITM攻撃収束分析 2025.02.11 2025.04.03 文献データベース