On existence, uniqueness and scalability of adversarial robustness measures for AI classifiers Authors: Illia Horenko | Published: 2023-10-19 | Updated: 2023-11-15 2023.10.19 2025.04.03 文献データベース
Privacy Preserving Large Language Models: ChatGPT Case Study Based Vision and Framework Authors: Imdad Ullah, Najm Hassan, Sukhpal Singh Gill, Basem Suleiman, Tariq Ahamed Ahanger, Zawar Shah, Junaid Qadir, Salil S. Kanhere | Published: 2023-10-19 2023.10.19 2025.04.03 文献データベース
Attack Prompt Generation for Red Teaming and Defending Large Language Models Authors: Boyi Deng, Wenjie Wang, Fuli Feng, Yang Deng, Qifan Wang, Xiangnan He | Published: 2023-10-19 2023.10.19 2025.04.03 文献データベース
REMARK-LLM: A Robust and Efficient Watermarking Framework for Generative Large Language Models Authors: Ruisi Zhang, Shehzeen Samarah Hussain, Paarth Neekhara, Farinaz Koushanfar | Published: 2023-10-18 | Updated: 2024-04-08 2023.10.18 2025.04.03 文献データベース
Large Language Models for Code Analysis: Do LLMs Really Do Their Job? Authors: Chongzhou Fang, Ning Miao, Shaurya Srivastav, Jialin Liu, Ruoyu Zhang, Ruijie Fang, Asmita, Ryan Tsang, Najmeh Nazari, Han Wang, Houman Homayoun | Published: 2023-10-18 | Updated: 2024-03-05 2023.10.18 2025.04.03 文献データベース
A Cautionary Tale: On the Role of Reference Data in Empirical Privacy Defenses Authors: Caelin G. Kaplan, Chuan Xu, Othmane Marfoq, Giovanni Neglia, Anderson Santana de Oliveira | Published: 2023-10-18 2023.10.18 2025.04.03 文献データベース
A General Theoretical Paradigm to Understand Learning from Human Preferences Authors: Mohammad Gheshlaghi Azar, Mark Rowland, Bilal Piot, Daniel Guo, Daniele Calandriello, Michal Valko, Rémi Munos | Published: 2023-10-18 | Updated: 2023-11-22 2023.10.18 2025.04.03 文献データベース
MalDICT: Benchmark Datasets on Malware Behaviors, Platforms, Exploitation, and Packers Authors: Robert J. Joyce, Edward Raff, Charles Nicholas, James Holt | Published: 2023-10-18 2023.10.18 2025.04.03 文献データベース
IoTGeM: Generalizable Models for Behaviour-Based IoT Attack Detection Authors: Kahraman Kostas, Mike Just, Michael A. Lones | Published: 2023-10-17 2023.10.17 2025.04.03 文献データベース
The Efficacy of Transformer-based Adversarial Attacks in Security Domains Authors: Kunyang Li, Kyle Domico, Jean-Charles Noirot Ferrand, Patrick McDaniel | Published: 2023-10-17 2023.10.17 2025.04.03 文献データベース