Adaptive Evaluation of Out-of-Band Defenses Against Prompt Injection in LLM Agents

Authors: Praneeth Narisetty, Shiva Nagendra Babu Kore, Uday Kumar Reddy Kattamanchi, Jayaram Kumarapu | Published: 2026-06-25

Detect, Unlearn, Restore: Defending Text Summarization Models Against Data Poisoning

Authors: Poojitha Thota, Shirin Nilizadeh | Published: 2026-06-24

Privacy Vulnerabilities of Attention Layers in Tabular Foundation Models and Protection of High-Risk Queries

Authors: Tânia Carvalho, Maxime Cordy | Published: 2026-06-24

Color Matters: Trigger Color Affects Success in Federated Backdoor Attacks

Authors: Kavindu Herath, Joshua C. Zhao, Saurabh Bagchi | Published: 2026-06-24

Can Machine Learning Break Wi-Fi Privacy? A Study on MAC Address Randomization

Authors: Marta Puig, Costas Michaelides, Lucia Pintor, Boris Bellalta, Francesc Wilhelmi | Published: 2026-06-24

Do Encoders Suffice? A Systematic Comparison of Encoder and Decoder Safety Judges for LLM Adversarial Evaluation

Authors: Han Jeon, Shiv Medler, Joseph Voyles, Matt Wood | Published: 2026-06-24

RAS: Measuring LLM Safety Through Refusal Alignment

Authors: Chang-Chieh Huang, Yan-Lun Chen, Chia-Mu Yu, Wei-Bin Lee | Published: 2026-06-24

AIエージェントシステム全体に関する脅威New

はじめにAI技術の発展に伴い、人間に代わって特定のタスクを自律的に行うAIエージェントを用いたシステム(AIエージェントシステム)の利活用が期待されています。大規模言語モデル(Large Language Model、LLM)を中核に、Ch...

Taxonomy of Risks on Automated Fact-Checking Systems Considering its Propagation

Authors: Jun Yajima, Tatsuya Oka, Takao Okubo | Published: 2026-06-24

An Approach for a Supporting Multi-LLM System for Automated Certification Based on the German IT-Grundschutz

Authors: Lea Roxanne Muth, Marian Margraf | Published: 2026-06-24