On the Detectability of ChatGPT Content: Benchmarking, Methodology, and Evaluation through the Lens of Academic Writing Authors: Zeyan Liu, Zijun Yao, Fengjun Li, Bo Luo | Published: 2023-06-07 | Updated: 2024-03-18 LLM ApplicationPrompt InjectionLiterature List 2023.06.07 2025.05.28 Literature Database
On Evaluating Adversarial Robustness of Large Vision-Language Models Authors: Yunqing Zhao, Tianyu Pang, Chao Du, Xiao Yang, Chongxuan Li, Ngai-Man Cheung, Min Lin | Published: 2023-05-26 | Updated: 2023-10-29 LLM Performance EvaluationPrompt InjectionAdversarial attack 2023.05.26 2025.05.28 Literature Database
Spear Phishing With Large Language Models Authors: Julian Hazell | Published: 2023-05-11 | Updated: 2023-12-22 Cyber AttackPhishing AttackPrompt Injection 2023.05.11 2025.05.28 Literature Database
In ChatGPT We Trust? Measuring and Characterizing the Reliability of ChatGPT Authors: Xinyue Shen, Zeyuan Chen, Michael Backes, Yang Zhang | Published: 2023-04-18 | Updated: 2023-10-05 LLM SecurityPrompt InjectionUser Experience Evaluation 2023.04.18 2025.05.28 Literature Database
Multi-step Jailbreaking Privacy Attacks on ChatGPT Authors: Haoran Li, Dadi Guo, Wei Fan, Mingshi Xu, Jie Huang, Fanpu Meng, Yangqiu Song | Published: 2023-04-11 | Updated: 2023-11-01 LLM SecurityPrivacy AnalysisPrompt Injection 2023.04.11 2025.05.28 Literature Database
Certifiable Black-Box Attacks with Randomized Adversarial Examples: Breaking Defenses with Provable Confidence Authors: Hanbin Hong, Xinyu Zhang, Binghui Wang, Zhongjie Ba, Yuan Hong | Published: 2023-04-10 | Updated: 2024-09-06 Prompt InjectionExperimental ValidationAttack Evaluation 2023.04.10 2025.05.28 Literature Database
DiverseVul: A New Vulnerable Source Code Dataset for Deep Learning Based Vulnerability Detection Authors: Yizheng Chen, Zhoujie Ding, Lamya Alowain, Xinyun Chen, David Wagner | Published: 2023-04-01 | Updated: 2023-08-09 Security labelPrompt InjectionVulnerability detection 2023.04.01 2025.05.28 Literature Database
MGTBench: Benchmarking Machine-Generated Text Detection Authors: Xinlei He, Xinyue Shen, Zeyuan Chen, Michael Backes, Yang Zhang | Published: 2023-03-26 | Updated: 2024-01-16 MGT Detection MethodPrompt InjectionPerformance Evaluation 2023.03.26 2025.05.28 Literature Database
Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense Authors: Kalpesh Krishna, Yixiao Song, Marzena Karpinska, John Wieting, Mohit Iyyer | Published: 2023-03-23 | Updated: 2023-10-18 DNN IP Protection MethodPrompt InjectionMachine Learning Technology 2023.03.23 2025.05.28 Literature Database
Not what you’ve signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection Authors: Kai Greshake, Sahar Abdelnabi, Shailesh Mishra, Christoph Endres, Thorsten Holz, Mario Fritz | Published: 2023-02-23 | Updated: 2023-05-05 Indirect Prompt InjectionPrompt InjectionMalicious Prompt 2023.02.23 2025.05.28 Literature Database