Attack Prompt Generation for Red Teaming and Defending Large Language Models Authors: Boyi Deng, Wenjie Wang, Fuli Feng, Yang Deng, Qifan Wang, Xiangnan He | Published: 2023-10-19 2023.10.19 2025.04.03 文献データベース
REMARK-LLM: A Robust and Efficient Watermarking Framework for Generative Large Language Models Authors: Ruisi Zhang, Shehzeen Samarah Hussain, Paarth Neekhara, Farinaz Koushanfar | Published: 2023-10-18 | Updated: 2024-04-08 2023.10.18 2025.04.03 文献データベース
Large Language Models for Code Analysis: Do LLMs Really Do Their Job? Authors: Chongzhou Fang, Ning Miao, Shaurya Srivastav, Jialin Liu, Ruoyu Zhang, Ruijie Fang, Asmita, Ryan Tsang, Najmeh Nazari, Han Wang, Houman Homayoun | Published: 2023-10-18 | Updated: 2024-03-05 2023.10.18 2025.04.03 文献データベース
A Cautionary Tale: On the Role of Reference Data in Empirical Privacy Defenses Authors: Caelin G. Kaplan, Chuan Xu, Othmane Marfoq, Giovanni Neglia, Anderson Santana de Oliveira | Published: 2023-10-18 2023.10.18 2025.04.03 文献データベース
A General Theoretical Paradigm to Understand Learning from Human Preferences Authors: Mohammad Gheshlaghi Azar, Mark Rowland, Bilal Piot, Daniel Guo, Daniele Calandriello, Michal Valko, Rémi Munos | Published: 2023-10-18 | Updated: 2023-11-22 2023.10.18 2025.04.03 文献データベース
MalDICT: Benchmark Datasets on Malware Behaviors, Platforms, Exploitation, and Packers Authors: Robert J. Joyce, Edward Raff, Charles Nicholas, James Holt | Published: 2023-10-18 2023.10.18 2025.04.03 文献データベース
IoTGeM: Generalizable Models for Behaviour-Based IoT Attack Detection Authors: Kahraman Kostas, Mike Just, Michael A. Lones | Published: 2023-10-17 2023.10.17 2025.04.03 文献データベース
The Efficacy of Transformer-based Adversarial Attacks in Security Domains Authors: Kunyang Li, Kyle Domico, Jean-Charles Noirot Ferrand, Patrick McDaniel | Published: 2023-10-17 2023.10.17 2025.04.03 文献データベース
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection Authors: Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, Hannaneh Hajishirzi | Published: 2023-10-17 2023.10.17 2025.04.03 文献データベース
Last One Standing: A Comparative Analysis of Security and Privacy of Soft Prompt Tuning, LoRA, and In-Context Learning Authors: Rui Wen, Tianhao Wang, Michael Backes, Yang Zhang, Ahmed Salem | Published: 2023-10-17 2023.10.17 2025.04.03 文献データベース