Tree of Attacks: Jailbreaking Black-Box LLMs Automatically
Authors: Anay Mehrotra, Manolis Zampetakis, Paul Kassianik, Blaine Nelson, Hyrum Anderson, Yaron Singer, Amin Karbasi | Published: 2023-12-04 | Updated: 2024-10-31
Query Generation Method
Prompt Injection
Watermark Evaluation