攻撃手法の効果

Dagger Behind Smile: Fool LLMs with a Happy Ending Story

Authors: Xurui Song, Zhixin Xie, Shuo Huai, Jiayi Kong, Jun Luo | Published: 2025-01-19 | Updated: 2025-09-30
Disabling Safety Mechanisms of LLM
Malicious Prompt
攻撃手法の効果