Adaptive Evaluation of Out-of-Band Defenses Against Prompt Injection in LLM Agents Authors: Praneeth Narisetty, Shiva Nagendra Babu Kore, Uday Kumar Reddy Kattamanchi, Jayaram Kumarapu | Published: 2026-06-25 2026.06.25 文献データベース
Detect, Unlearn, Restore: Defending Text Summarization Models Against Data Poisoning Authors: Poojitha Thota, Shirin Nilizadeh | Published: 2026-06-24 2026.06.24 文献データベース
Privacy Vulnerabilities of Attention Layers in Tabular Foundation Models and Protection of High-Risk Queries Authors: Tânia Carvalho, Maxime Cordy | Published: 2026-06-24 2026.06.24 文献データベース
Color Matters: Trigger Color Affects Success in Federated Backdoor Attacks Authors: Kavindu Herath, Joshua C. Zhao, Saurabh Bagchi | Published: 2026-06-24 2026.06.24 文献データベース
Can Machine Learning Break Wi-Fi Privacy? A Study on MAC Address Randomization Authors: Marta Puig, Costas Michaelides, Lucia Pintor, Boris Bellalta, Francesc Wilhelmi | Published: 2026-06-24 2026.06.24 文献データベース
Do Encoders Suffice? A Systematic Comparison of Encoder and Decoder Safety Judges for LLM Adversarial Evaluation Authors: Han Jeon, Shiv Medler, Joseph Voyles, Matt Wood | Published: 2026-06-24 2026.06.24 文献データベース
RAS: Measuring LLM Safety Through Refusal Alignment Authors: Chang-Chieh Huang, Yan-Lun Chen, Chia-Mu Yu, Wei-Bin Lee | Published: 2026-06-24 2026.06.24 文献データベース
AIエージェントシステム全体に関する脅威New はじめにAI技術の発展に伴い、人間に代わって特定のタスクを自律的に行うAIエージェントを用いたシステム(AIエージェントシステム)の利活用が期待されています。大規模言語モデル(Large Language Model、LLM)を中核に、Ch... 2026.06.24 専門家向け解説記事
Taxonomy of Risks on Automated Fact-Checking Systems Considering its Propagation Authors: Jun Yajima, Tatsuya Oka, Takao Okubo | Published: 2026-06-24 2026.06.24 文献データベース
An Approach for a Supporting Multi-LLM System for Automated Certification Based on the German IT-Grundschutz Authors: Lea Roxanne Muth, Marian Margraf | Published: 2026-06-24 2026.06.24 文献データベース