文献データベース

Swiss-Bench 003: Evaluating LLM Reliability and Adversarial Security for Swiss Regulatory Contexts

Authors: Fatih Uenal | Published: 2026-04-07
LLM性能評価
フレームワーク
モデル性能評価

ClawLess: A Security Model of AI Agents

Authors: Hongyi Lu, Nian Liu, Shuai Wang, Fengwei Zhang | Published: 2026-04-07
セキュアアグリゲーション
フレームワーク
動的ポリシー適応

Hackers or Hallucinators? A Comprehensive Analysis of LLM-Based Automated Penetration Testing

Authors: Jiaren Peng, Zeqin Li, Chang You, Yan Wang, Hanlin Sun, Xuan Tian, Shuqiao Zhang, Junyi Liu, Jianguo Zhao, Renyang Liu, Haoran Ou, Yuqiang Sun, Jiancheng Zhang, Yutong Jiao, Kunshu Song, Chao Zhang, Fan Shi, Hongda Sun, Rui Yan, Cheng Huang | Published: 2026-04-07
RAG
RAGへのポイズニング攻撃
フレームワーク

Foundations for Agentic AI Investigations from the Forensic Analysis of OpenClaw

Authors: Jan Gruber, Jan-Niclas Hilgert | Published: 2026-04-07
AIシステムの関係性
データ収集
行動分析手法

Towards the Development of an LLM-Based Methodology for Automated Security Profiling in Compliance with Ukrainian Cybersecurity Regulations

Authors: Daniil Shafranskyi, Iryna Stopochkina, Mykola Ilin | Published: 2026-04-07
LLM性能評価
RAG
セキュリティプロファイリング

AttnDiff: Attention-based Differential Fingerprinting for Large Language Models

Authors: Haobo Zhang, Zhenhua Xu, Junxian Li, Shangfeng Sheng, Dezhang Kong, Meng Han | Published: 2026-04-07
LLM性能評価
モデルの頑健性保証
モデル識別

MA-IDS: Multi-Agent RAG Framework for IoT Network Intrusion Detection with an Experience Library

Authors: Md Shamimul Islam, Luis G. Jaimes, Ayesha S. Dina | Published: 2026-04-07
IoTセキュリティフレームワーク
RAG
RAGへのポイズニング攻撃

Your LLM Agent Can Leak Your Data: Data Exfiltration via Backdoored Tool Use

Authors: Wuyang Zhang, Shichao Pei | Published: 2026-04-07
RAG
データ漏洩
攻撃手法評価

Attribution-Driven Explainable Intrusion Detection with Encoder-Based Large Language Models

Authors: Umesh Biswas, Shafqat Hasan, Syed Mohammed Farhan, Nisha Pillai, Charan Gudla | Published: 2026-04-07
LLM性能評価
データセット生成
解釈手法

RuleForge: Automated Generation and Validation for Web Vulnerability Detection at Scale

Authors: Ayush Garg, Sophia Hager, Jacob Montiel, Aditya Tiwari, Michael Gentile, Zach Reavis, David Magnotti, Wayne Fullen | Published: 2026-04-02
LLM性能評価
脆弱性優先順位付け
自動生成フレームワーク