Toward Securing AI Agents Like Operating Systems Authors: Lukas Pirch, Micha Horlboge, Patrick Großmann, Syeda Mahnur Asif, Klim Kireev, Thorsten Holz, Konrad Rieck | Published: 2026-05-14 2026.05.14 文献データベース
EVA: Editing for Versatile Alignment against Jailbreaks Authors: Yi Wang, Hongye Qiu, Yue Xu, Sibei Yang, Zhan Qin, Minlie Huang, Wenjie Wang | Published: 2026-05-14 2026.05.14 文献データベース
Defenses at Odds: Measuring and Explaining Defense Conflicts in Large Language Models Authors: Xiangtao Meng, Wenyu Chen, Chuanchao Zang, Xinyu Gao, Jianing Wang, Li Wang, Zheng Li, Shanqing Guo | Published: 2026-05-14 2026.05.14 文献データベース
Exploiting LLM Agent Supply Chains via Payload-less Skills Authors: Xinyu Liu, Yukai Zhao, Xing Hu, Xin Xia | Published: 2026-05-14 2026.05.14 文献データベース
Watermarking Game-Playing Agents in Perfect-Information Extensive-Form Games Authors: Juho Kim, Fei Fang, Tuomas Sandholm | Published: 2026-05-14 2026.05.14 文献データベース
Identifying AI Web Scrapers Using Canary Tokens Authors: Steven Seiden, Triss Ren, Caroline Zhang, Taein Kim, Enze Liu, Emily Wenger | Published: 2026-05-13 2026.05.13 文献データベース
AIエージェントによる悪用に関する脅威 はじめにAI技術の発展に伴い、人間に変わって特定のタスクを自律的に行うAIエージェントを用いたシステム(AIエージェントシステム)の利活用が期待されています。大規模言語モデル(Large Language Model, LLM)を中核に、C... 2026.05.13 専門家向け解説記事
Model-Agnostic Lifelong LLM Safety via Externalized Attack-Defense Co-Evolution Authors: Xiaozhe Zhang, Chaozhuo Li, Hui Liu, Shaocheng Yan, Bingyu Yan, Qiwei Ye, Haoliang Li | Published: 2026-05-13 2026.05.13 文献データベース
Empowering IoT Security: On-Device Intrusion Detection in Resource Constrained Devices Authors: Vasilis Ieropoulos, Eirini Anthi, Theodoros Spyridopoulos, Pete Burnap, Aftab Khan, Pietro Carnelli | Published: 2026-05-13 2026.05.13 文献データベース
Quantifying LLM Safety Degradation Under Repeated Attacks Using Survival Analysis Authors: Zvi Topol | Published: 2026-05-13 2026.05.13 文献データベース