Safeguard-by-Development: A Privacy-Enhanced Development Paradigm for Multi-Agent Collaboration Systems Authors: Jian Cui, Zichuan Li, Luyi Xing, Xiaojing Liao | Published: 2025-05-07 | Updated: 2025-06-24 2025.05.07 文献データベース
OBLIVIATE: Robust and Practical Machine Unlearning for Large Language Models Authors: Xiaoyu Xu, Minxin Du, Qingqing Ye, Haibo Hu | Published: 2025-05-07 2025.05.07 文献データベース
敵対的サンプルから守る、敵対的学習 敵対的サンプルに対する対策技術の1つに、敵対的学習があります。本記事では、敵対的学習を用いて、どのように敵対的サンプルの影響を防ぐかを解説します。 2025.05.07 専門家向け解説記事
Weaponizing Language Models for Cybersecurity Offensive Operations: Automating Vulnerability Assessment Report Validation; A Review Paper Authors: Abdulrahman S Almuhaidib, Azlan Mohd Zain, Zalmiyah Zakaria, Izyan Izzati Kamsani, Abdulaziz S Almuhaidib | Published: 2025-05-07 2025.05.07 文献データベース
AutoPatch: Multi-Agent Framework for Patching Real-World CVE Vulnerabilities Authors: Minjae Seo, Wonwoo Choi, Myoungsung You, Seungwon Shin | Published: 2025-05-07 2025.05.07 文献データベース
LLMs’ Suitability for Network Security: A Case Study of STRIDE Threat Modeling Authors: AbdulAziz AbdulGhaffar, Ashraf Matrawy | Published: 2025-05-07 2025.05.07 文献データベース
LlamaFirewall: An open source guardrail system for building secure AI agents Authors: Sahana Chennabasappa, Cyrus Nikolaidis, Daniel Song, David Molnar, Stephanie Ding, Shengye Wan, Spencer Whitman, Lauren Deason, Nicholas Doucette, Abraham Montilla, Alekhya Gampa, Beto de Paola, Dominik Gabi, James Crnkovich, Jean-Christophe Testud, Kat He, Rashnil Chaturvedi, Wu Zhou, Joshua Saxe | Published: 2025-05-06 2025.05.06 文献データベース
BadLingual: A Novel Lingual-Backdoor Attack against Large Language Models Authors: Zihan Wang, Hongwei Li, Rui Zhang, Wenbo Jiang, Kangjie Chen, Tianwei Zhang, Qingchuan Zhao, Guowen Xu | Published: 2025-05-06 2025.05.06 文献データベース
Detecting Quishing Attacks with Machine Learning Techniques Through QR Code Analysis Authors: Fouad Trad, Ali Chehab | Published: 2025-05-06 2025.05.06 文献データベース
The Steganographic Potentials of Language Models Authors: Artem Karpov, Tinuade Adeleke, Seong Hah Cho, Natalia Perez-Campanero | Published: 2025-05-06 2025.05.06 文献データベース