Literature Database

Swiss-Bench 003: Evaluating LLM Reliability and Adversarial Security for Swiss Regulatory Contexts

Authors: Fatih Uenal | Published: 2026-04-07
LLM Performance Evaluation
Framework
Model Performance Evaluation

ClawLess: A Security Model of AI Agents

Authors: Hongyi Lu, Nian Liu, Shuai Wang, Fengwei Zhang | Published: 2026-04-07
Secure Aggregation
Framework
動的ポリシー適応

Hackers or Hallucinators? A Comprehensive Analysis of LLM-Based Automated Penetration Testing

Authors: Jiaren Peng, Zeqin Li, Chang You, Yan Wang, Hanlin Sun, Xuan Tian, Shuqiao Zhang, Junyi Liu, Jianguo Zhao, Renyang Liu, Haoran Ou, Yuqiang Sun, Jiancheng Zhang, Yutong Jiao, Kunshu Song, Chao Zhang, Fan Shi, Hongda Sun, Rui Yan, Cheng Huang | Published: 2026-04-07
RAG
Poisoning attack on RAG
Framework

Foundations for Agentic AI Investigations from the Forensic Analysis of OpenClaw

Authors: Jan Gruber, Jan-Niclas Hilgert | Published: 2026-04-07
Relationship of AI Systems
Data Collection
行動分析手法

Towards the Development of an LLM-Based Methodology for Automated Security Profiling in Compliance with Ukrainian Cybersecurity Regulations

Authors: Daniil Shafranskyi, Iryna Stopochkina, Mykola Ilin | Published: 2026-04-07
LLM Performance Evaluation
RAG
セキュリティプロファイリング

AttnDiff: Attention-based Differential Fingerprinting for Large Language Models

Authors: Haobo Zhang, Zhenhua Xu, Junxian Li, Shangfeng Sheng, Dezhang Kong, Meng Han | Published: 2026-04-07
LLM Performance Evaluation
Certified Robustness
Model Identification

MA-IDS: Multi-Agent RAG Framework for IoT Network Intrusion Detection with an Experience Library

Authors: Md Shamimul Islam, Luis G. Jaimes, Ayesha S. Dina | Published: 2026-04-07
IoT Security Framework
RAG
Poisoning attack on RAG

Your LLM Agent Can Leak Your Data: Data Exfiltration via Backdoored Tool Use

Authors: Wuyang Zhang, Shichao Pei | Published: 2026-04-07
RAG
Data Leakage
攻撃手法評価

Attribution-Driven Explainable Intrusion Detection with Encoder-Based Large Language Models

Authors: Umesh Biswas, Shafqat Hasan, Syed Mohammed Farhan, Nisha Pillai, Charan Gudla | Published: 2026-04-07
LLM Performance Evaluation
Dataset Generation
Interpretation Method

RuleForge: Automated Generation and Validation for Web Vulnerability Detection at Scale

Authors: Ayush Garg, Sophia Hager, Jacob Montiel, Aditya Tiwari, Michael Gentile, Zach Reavis, David Magnotti, Wayne Fullen | Published: 2026-04-02
LLM Performance Evaluation
脆弱性優先順位付け
自動生成フレームワーク