文献データベース

Agentic Cloud Decoys: A Deception-Driven Framework for Autonomous Intrusion Investigation

Authors: Mohan Manivannan, Dalal Alharthi | Published: 2026-07-27

インダイレクトプロンプトインジェクション

報告生成

評価結果

2026.07.27

文献データベース

Tag Questions and the Generational Reversal of Sycophancy Across 45 Language Models

Authors: Tapan Parikh | Published: 2026-07-27

RAG

モデル通信

ユーザー行動分析

2026.07.27

文献データベース

Understanding Machine Unlearning Through the Lens of Mode Connectivity

Authors: Jiali Cheng, Hadi Amiri | Published: 2026-07-27

データセット評価

モデル保護手法

機械学習

2026.07.27

文献データベース

V-DEAL: Diagnosing Video Safety De-Calibration as an Understanding-Refusal Coupling Failure

Authors: Zhetong Zhang, Honghao Fu, Miao Xu, Yiwei Wang, Yujun Cai | Published: 2026-07-23

ユーザー行動分析

リスク評価

攻撃手法の効果

2026.07.23

文献データベース

TOUR: A Trajectory-Level Unlearning Benchmark for Offline Reinforcement Learning

Authors: Chaofan Pan, Lingfei Ren, Xiangyu Jiang, Yanhua Li, Xuemei Cao, Xiangkun Wang, Hao Yu, Wei Wei, Xin Yang | Published: 2026-07-23

データセット評価

攻撃手法の効果

文献レビュー

2026.07.23

文献データベース

GuardianAgentBench: Where Agents Fail and How to Guard Them

Authors: Vishal Ishwar Naik, Chenyu Xu, Donna Dong, Hussein Hassan, Abhishek Pradhan, Ofer Mendelevitch, Tallat Shafat, Humayun Irshad | Published: 2026-07-23

インダイレクトプロンプトインジェクション

タスク設計

プロンプトインジェクション

2026.07.23

文献データベース

Is Deep Research Reliable? Misleading Knowledge Induces False Conclusions

Authors: Pengyu Zhu, Lijun Li, Longju Yang, Sen Su | Published: 2026-07-23

タスク設計

データセットの問題

偽情報の検出

2026.07.23

文献データベース

Beyond Heavy Log Curation: Perplexity-Based APT Detection via Unsupervised, Context-Augmented Language Models

Authors: Shoya Otsu, Kei Suzuki, Toshiaki Koike-Akino, Jing Liu, Ye Wang | Published: 2026-07-23

データセット評価

モデル通信

機械学習

2026.07.23

文献データベース

Generative AI floods and dilutes the market for books

Authors: Tuhin Chakrabarty, Xinyue Liu, Jane C. Ginsburg, Paramveer Dhillon | Published: 2026-07-22

データセットの問題

ユーザー行動分析

統計的分析

2026.07.22

文献データベース

The Ethics of Autonomous AI Agents for Offensive Security

Authors: Andreas Happe, Jürgen Cito, Jasmin Wachter | Published: 2026-07-22

インダイレクトプロンプトインジェクション

倫理基準遵守

責任帰属システム設計

2026.07.22

文献データベース