Towards Identification and Intervention of Safety-Critical Parameters in Large Language Models Authors: Weiwei Qi, Zefeng Wu, Tianhang Zheng, Zikang Zhang, Xiaojun Jia, Zhan Qin, Kui Ren | Published: 2026-04-09 2026.04.09 文献データベース
The Art of (Mis)alignment: How Fine-Tuning Methods Effectively Misalign and Realign LLMs in Post-Training Authors: Rui Zhang, Hongwei Li, Yun Shen, Xinyue Shen, Wenbo Jiang, Guowen Xu, Yang Liu, Michael Backes, Yang Zhang | Published: 2026-04-09 2026.04.09 文献データベース
On the Price of Privacy for Language Identification and Generation Authors: Xiaoyu Li, Andi Han, Jiaojiao Jiang, Junbin Gao | Published: 2026-04-08 2026.04.08 文献データベース
TraceSafe: A Systematic Assessment of LLM Guardrails on Multi-Step Tool-Calling Trajectories Authors: Yen-Shan Chen, Sian-Yao Huang, Cheng-Lin Yang, Yun-Nung Chen | Published: 2026-04-08 2026.04.08 文献データベース
VulGD: A LLM-Powered Dynamic Open-Access Vulnerability Graph Database Authors: Luat Do, Jiao Yin, Jinli Cao, Hua Wang | Published: 2026-04-08 2026.04.08 文献データベース
Data Leakage in Automotive Perception: Practitioners’ Insights Authors: Md Abu Ahammed Babu, Sushant Kumar Pandey, Darko Durisic, Andras Balint, Miroslaw Staron | Published: 2026-04-08 2026.04.08 文献データベース
SentinelSphere: Integrating AI-Powered Real-Time Threat Detection with Cybersecurity Awareness Training Authors: Nikolaos D. Tantaroudas, Ilias Karachalios, Andrew J. McCracken | Published: 2026-04-08 2026.04.08 文献データベース
MirageBackdoor: A Stealthy Attack that Induces Think-Well-Answer-Wrong Reasoning Authors: Yizhe Zeng, Wei Zhang, Yunpeng Li, Juxin Xiao, Xiao Wang, Yuling Liu | Published: 2026-04-08 2026.04.08 文献データベース
Argus: Reorchestrating Static Analysis via a Multi-Agent Ensemble for Full-Chain Security Vulnerability Detection Authors: Zi Liang, Qipeng Xie, Jun He, Bohuan Xue, Weizheng Wang, Yuandao Cai, Fei Luo, Boxian Zhang, Haibo Hu, Kaishun Wu | Published: 2026-04-08 2026.04.08 文献データベース
PoC-Adapt: Semantic-Aware Automated Vulnerability Reproduction with LLM Multi-Agents and Reinforcement Learning-Driven Adaptive Policy Authors: Phan The Duy, Nguyen Viet Duy, Khoa Ngo-Khanh, Nguyen Huu Quyen, Van-Hau Pham | Published: 2026-04-08 2026.04.08 文献データベース