When LLMs Copy to Think: Uncovering Copy-Guided Attacks in Reasoning LLMs Authors: Yue Li, Xiao Li, Hao Wu, Yue Zhang, Fengyuan Xu, Xiuzhen Cheng, Sheng Zhong | Published: 2025-07-22 Prompt leakingModel DoSAttack Method 2025.07.22 2025.07.24 Literature Database
Visual Contextual Attack: Jailbreaking MLLMs with Image-Driven Context Injection Authors: Ziqi Miao, Yi Ding, Lijun Li, Jing Shao | Published: 2025-07-03 Prompt InjectionModel DoSAdversarial Attack Methods 2025.07.03 2025.07.05 Literature Database
Evaluating Language Models For Threat Detection in IoT Security Logs Authors: Jorge J. Tejero-Fernández, Alfonso Sánchez-Macián | Published: 2025-07-03 IoT Traffic AnalysisModel DoSVulnerability Analysis 2025.07.03 2025.07.05 Literature Database
SoK: Data Reconstruction Attacks Against Machine Learning Models: Definition, Metrics, and Benchmark Authors: Rui Wen, Yiyong Liu, Michael Backes, Yang Zhang | Published: 2025-06-09 Model DoS再構成アルゴリズム評価メトリクス 2025.06.09 2025.06.11 Literature Database
Attention Knows Whom to Trust: Attention-based Trust Management for LLM Multi-Agent Systems Authors: Pengfei He, Zhenwei Dai, Xianfeng Tang, Yue Xing, Hui Liu, Jingying Zeng, Qiankun Peng, Shrivats Agrawal, Samarth Varshney, Suhang Wang, Jiliang Tang, Qi He | Published: 2025-06-03 Indirect Prompt InjectionModel DoSEthical Considerations 2025.06.03 2025.06.05 Literature Database
A Red Teaming Roadmap Towards System-Level Safety Authors: Zifan Wang, Christina Q. Knight, Jeremy Kritz, Willow E. Primack, Julian Michael | Published: 2025-05-30 | Updated: 2025-06-09 Model DoSLarge Language Model製品安全性 2025.05.30 2025.06.11 Literature Database
IRCopilot: Automated Incident Response with Large Language Models Authors: Xihuan Lin, Jie Zhang, Gelei Deng, Tianzhe Liu, Xiaolong Liu, Changcai Yang, Tianwei Zhang, Qing Guo, Riqing Chen | Published: 2025-05-27 LLM SecurityIndirect Prompt InjectionModel DoS 2025.05.27 2025.05.29 Literature Database
From Trade-off to Synergy: A Versatile Symbiotic Watermarking Framework for Large Language Models Authors: Yidan Wang, Yubing Ren, Yanan Cao, Binxing Fang | Published: 2025-05-15 Model DoSDigital Watermarking for Generative AIWatermark Removal Technology 2025.05.15 2025.05.28 Literature Database
A Weighted Byzantine Fault Tolerance Consensus Driven Trusted Multiple Large Language Models Network Authors: Haoxiang Luo, Gang Sun, Yinqiu Liu, Dongcheng Zhao, Dusit Niyato, Hongfang Yu, Schahram Dustdar | Published: 2025-05-08 Byzantine Consensus MechanismModel DoSReliability Assessment 2025.05.08 2025.05.27 Literature Database
OBLIVIATE: Robust and Practical Machine Unlearning for Large Language Models Authors: Xiaoyu Xu, Minxin Du, Qingqing Ye, Haibo Hu | Published: 2025-05-07 Token Identification MethodModel DoSPerformance Evaluation 2025.05.07 2025.05.27 Literature Database