Literature Database

R-CoT: A Reasoning-Layer Watermark via Redundant Chain-of-Thought in Large Language Models

Authors: Ziming Zhang, Li Li, Guorui Feng, Hanzhou Wu, Xinpeng Zhang | Published: 2026-04-28
Prompt Injection
報酬関数設計
Verifiable Credentials

Making AI-Assisted Grant Evaluation Auditable without Exposing the Model

Authors: Kemal Bicakci | Published: 2026-04-28
リスクシナリオ生成
Verifiable Credentials
Evaluation Method

AgentWard: A Lifecycle Security Architecture for Autonomous AI Agents

Authors: Yixiang Zhang, Xinhao Deng, Jiaqing Wu, Yue Xiao, Ke Xu, Qi Li | Published: 2026-04-27
Indirect Prompt Injection
リスクシナリオ生成
Attack Chain Analysis

Layerwise Convergence Fingerprints for Runtime Misbehavior Detection in Large Language Models

Authors: Nay Myat Min, Long H. Pham, Jun Sun | Published: 2026-04-27
Indirect Prompt Injection
Prompt Injection
Generalization Performance

GAMMAF: A Common Framework for Graph-Based Anomaly Monitoring Benchmarking in LLM Multi-Agent Systems

Authors: Pablo Mateo-Torrejón, Alfonso Sánchez-Macián | Published: 2026-04-27
LLM Performance Evaluation
Indirect Prompt Injection
マルチエージェントシステム

A Survey on Split Learning for LLM Fine-Tuning: Models, Systems, and Privacy Optimizations

Authors: Zihan Liu, Yizhen Wang, Rui Wang, Xiu Tang, Sai Wu | Published: 2026-04-27
Bias Detection in AI Output
Privacy Protection Method
Federated Learning

Defusing the Trigger: Plug-and-Play Defense for Backdoored LLMs via Tail-Risk Intrinsic Geometric Smoothing

Authors: Kaisheng Fan, Weizhe Zhang, Yishu Gao, Tegawendé F. Bissyandé, Xunzhu Tang | Published: 2026-04-27
Backdoor Detection
Model Extraction Attack
Attack Chain Analysis

AgentVisor: Defending LLM Agents Against Prompt Injection via Semantic Virtualization

Authors: Zonghao Ying, Haozheng Wang, Jiangfan Liu, Quanchen Zou, Aishan Liu, Jian Yang, Yaodong Yang, Xianglong Liu | Published: 2026-04-27
LLM Performance Evaluation
Indirect Prompt Injection
Data Protection Method

An Information-Geometric Framework for Stability Analysis of Large Language Models under Entropic Stress

Authors: Hikmat Karimov, Rahid Zahid Alekberli | Published: 2026-04-27
Generalization Performance
Interpretability
Evaluation Method

System-aware contextual digital twin for ICS anomaly diagnosis

Authors: Eungyu Woo, Yooshin Kim, Wonje Heo, Donghoon Shin | Published: 2026-04-27
Class Imbalance
Interpretability