Enabling Regulatory Multi-Agent Collaboration: Architecture, Challenges, and Solutions Authors: Qinnan Hu, Yuntao Wang, Yuan Gao, Zhou Su, Linkang Du | Published: 2025-09-11 Relationship of AI Systems倫理基準遵守Anomaly Detection Method 2025.09.11 2025.09.13 Literature Database
Confusion is the Final Barrier: Rethinking Jailbreak Evaluation and Investigating the Real Misuse Threat of LLMs Authors: Yu Yan, Sheng Sun, Zhe Wang, Yijun Lin, Zenghao Duan, zhifei zheng, Min Liu, Zhiyi yin, Jianping Zhang | Published: 2025-08-22 | Updated: 2025-09-15 Privacy Assessment倫理基準遵守Large Language Model 2025.08.22 2025.09.17 Literature Database
Rethinking Exact Unlearning under Exposure: Extracting Forgotten Data under Exact Unlearning in Large Language Model Authors: Xiaoyu Wu, Yifei Pang, Terrance Liu, Zhiwei Steven Wu | Published: 2025-05-30 | Updated: 2025-10-06 Privacy-Preserving Machine LearningPrivacy Loss Analysis倫理基準遵守 2025.05.30 2025.10.08 Literature Database
Adversarial Suffix Filtering: a Defense Pipeline for LLMs Authors: David Khachaturov, Robert Mullins | Published: 2025-05-14 Prompt validation倫理基準遵守Attack Detection Method 2025.05.14 2025.05.28 Literature Database