ProxyGPT: Enabling User Anonymity in LLM Chatbots via (Un)Trustworthy Volunteer Proxies Authors: Dzung Pham, Jade Sheffey, Chau Minh Pham, Amir Houmansadr | Published: 2024-07-11 | Updated: 2025-06-11 Privacy Enhancing TechnologyPrompt InjectionPrompt leaking 2024.07.11 2025.06.13 Literature Database
Human-Imperceptible Retrieval Poisoning Attacks in LLM-Powered Applications Authors: Quan Zhang, Binqi Zeng, Chijin Zhou, Gwihwan Go, Heyuan Shi, Yu Jiang | Published: 2024-04-26 Poisoning attack on RAGPrompt leakingPoisoning 2024.04.26 2025.05.27 Literature Database
Stealing Part of a Production Language Model Authors: Nicholas Carlini, Daniel Paleka, Krishnamurthy Dj Dvijotham, Thomas Steinke, Jonathan Hayase, A. Feder Cooper, Katherine Lee, Matthew Jagielski, Milad Nasr, Arthur Conmy, Itay Yona, Eric Wallace, David Rolnick, Florian Tramèr | Published: 2024-03-11 | Updated: 2024-07-09 Prompt leakingModel RobustnessModel Extraction Attack 2024.03.11 2025.05.27 Literature Database
Secret Collusion among Generative AI Agents: Multi-Agent Deception via Steganography Authors: Sumeet Ramesh Motwani, Mikhail Baranchuk, Martin Strohmeier, Vijay Bolina, Philip H. S. Torr, Lewis Hammond, Christian Schroeder de Witt | Published: 2024-02-12 | Updated: 2025-04-14 Privacy Enhancing TechnologyPrompt leakingDigital Watermarking for Generative AI 2024.02.12 2025.05.27 Literature Database
Language Model Inversion Authors: John X. Morris, Wenting Zhao, Justin T. Chiu, Vitaly Shmatikov, Alexander M. Rush | Published: 2023-11-22 Prompt leakingModel InversionModel Evaluation 2023.11.22 2025.05.28 Literature Database
Assessing Prompt Injection Risks in 200+ Custom GPTs Authors: Jiahao Yu, Yuhang Wu, Dong Shu, Mingyu Jin, Sabrina Yang, Xinyu Xing | Published: 2023-11-20 | Updated: 2024-05-25 Prompt InjectionPrompt leakingDialogue System 2023.11.20 2025.05.28 Literature Database
You Only Prompt Once: On the Capabilities of Prompt Learning on Large Language Models to Tackle Toxic Content Authors: Xinlei He, Savvas Zannettou, Yun Shen, Yang Zhang | Published: 2023-08-10 Text DetoxificationPrompt leakingCalculation of Output Harmfulness 2023.08.10 2025.05.28 Literature Database
Effective Prompt Extraction from Language Models Authors: Yiming Zhang, Nicholas Carlini, Daphne Ippolito | Published: 2023-07-13 | Updated: 2024-08-07 Prompt InjectionPrompt leakingDialogue System 2023.07.13 2025.05.28 Literature Database
Undetectable Watermarks for Language Models Authors: Miranda Christ, Sam Gunn, Or Zamir | Published: 2023-05-25 Prompt leakingDigital Watermarking for Generative AIWatermarking Technology 2023.05.25 2025.05.28 Literature Database
Killing four birds with one Gaussian process: the relation between different test-time attacks Authors: Kathrin Grosse, Michael T. Smith, Michael Backes | Published: 2018-06-06 | Updated: 2020-11-29 Prompt leakingMembership InferenceWatermark Evaluation 2018.06.06 2025.05.28 Literature Database