Enabling Regulatory Multi-Agent Collaboration: Architecture, Challenges, and Solutions Authors: Qinnan Hu, Yuntao Wang, Yuan Gao, Zhou Su, Linkang Du | Published: 2025-09-11 AIシステムの関係性倫理基準遵守異常検知手法 2025.09.11 文献データベース
Confusion is the Final Barrier: Rethinking Jailbreak Evaluation and Investigating the Real Misuse Threat of LLMs Authors: Yu Yan, Sheng Sun, Zhe Wang, Yijun Lin, Zenghao Duan, zhifei zheng, Min Liu, Zhiyi yin, Jianping Zhang | Published: 2025-08-22 | Updated: 2025-09-15 プライバシー評価倫理基準遵守大規模言語モデル 2025.08.22 文献データベース
Rethinking Exact Unlearning under Exposure: Extracting Forgotten Data under Exact Unlearning in Large Language Model Authors: Xiaoyu Wu, Yifei Pang, Terrance Liu, Zhiwei Steven Wu | Published: 2025-05-30 | Updated: 2025-10-06 プライバシー保護機械学習プライバシー損失分析倫理基準遵守 2025.05.30 文献データベース
Adversarial Suffix Filtering: a Defense Pipeline for LLMs Authors: David Khachaturov, Robert Mullins | Published: 2025-05-14 プロンプトの検証倫理基準遵守攻撃検出手法 2025.05.14 文献データベース