攻撃成功率

Exposing the Systematic Vulnerability of Open-Weight Models to Prefill Attacks

Authors: Lukas Struppek, Adam Gleave, Kellin Pelrine | Published: 2026-02-16
Prompt Injection
Human Rights and Technology
攻撃成功率