Cape: Context-Aware Prompt Perturbation Mechanism with Differential Privacy Authors: Haoqi Wu, Wei Dai, Li Wang, Qiang Yan | Published: 2025-05-09 | Updated: 2025-05-15 トークン識別手法プライバシー設計原則評価手法 2025.05.09 文献データベース
Defending against Indirect Prompt Injection by Instruction Detection Authors: Tongyu Wen, Chenglong Wang, Xiyuan Yang, Haoyu Tang, Yueqi Xie, Lingjuan Lyu, Zhicheng Dou, Fangzhao Wu | Published: 2025-05-08 | Updated: 2025-09-17 プロンプトの検証評価手法透かし技術 2025.05.08 文献データベース
Towards a standardized methodology and dataset for evaluating LLM-based digital forensic timeline analysis Authors: Hudan Studiawan, Frank Breitinger, Mark Scanlon | Published: 2025-05-06 LLM性能評価大規模言語モデル評価手法 2025.05.06 文献データベース
GuidedBench: Measuring and Mitigating the Evaluation Discrepancies of In-the-wild LLM Jailbreak Methods Authors: Ruixuan Huang, Xunguang Wang, Zongjie Li, Daoyuan Wu, Shuai Wang | Published: 2025-02-24 | Updated: 2025-07-09 プロンプトインジェクション脱獄手法評価手法 2025.02.24 文献データベース
Evaluating and Improving the Robustness of Security Attack Detectors Generated by LLMs Authors: Samuele Pasini, Jinhan Kim, Tommaso Aiello, Rocio Cabrera Lozoya, Antonino Sabetta, Paolo Tonella | Published: 2024-11-27 | Updated: 2025-09-17 RAGRAGへのポイズニング攻撃評価手法 2024.11.27 文献データベース
Variational Bayesian Bow tie Neural Networks with Shrinkage Authors: Alisa Sheinkman, Sara Wade | Published: 2024-11-17 | Updated: 2024-11-19 スパースモデル最適化問題評価手法 2024.11.17 2025.04.03 文献データベース
FEDLAD: Federated Evaluation of Deep Leakage Attacks and Defenses Authors: Isaac Baglin, Xiatian Zhu, Simon Hadfield | Published: 2024-11-05 | Updated: 2025-01-05 ポイズニング攻撃の評価評価手法 2024.11.05 2025.04.03 文献データベース
Can LLMs be Scammed? A Baseline Measurement Study Authors: Udari Madhushani Sehwag, Kelly Patel, Francesca Mosca, Vineeth Ravi, Jessica Staddon | Published: 2024-10-14 LLM性能評価プロンプトインジェクション評価手法 2024.10.14 2025.04.03 文献データベース
FedCert: Federated Accuracy Certification Authors: Minh Hieu Nguyen, Huu Tien Nguyen, Trung Thanh Nguyen, Manh Duong Nguyen, Trong Nghia Hoang, Truong Thao Nguyen, Phi Le Nguyen | Published: 2024-10-04 評価手法 2024.10.04 2025.04.03 文献データベース
A novel application of Shapley values for large multidimensional time-series data: Applying explainable AI to a DNA profile classification neural network Authors: Lauren Elborough, Duncan Taylor, Melissa Humphries | Published: 2024-09-26 アルゴリズムウォーターマーキング評価手法 2024.09.26 2025.04.03 文献データベース