敵対的攻撃評価

PR-Attack: Coordinated Prompt-RAG Attacks on Retrieval-Augmented Generation in Large Language Models via Bilevel Optimization

Authors: Yang Jiao, Xiaodong Wang, Kai Yang | Published: 2025-04-10
LLM性能評価
RAGへのポイズニング攻撃
敵対的攻撃評価

Houdini: Fooling Deep Structured Prediction Models

Authors: Moustapha Cisse, Yossi Adi, Natalia Neverova, Joseph Keshet | Published: 2017-07-17
モデルの頑健性保証
敵対的攻撃評価
音声認識技術