DP-2Stage: Adapting Language Models as Differentially Private Tabular Data Generators Authors: Tejumade Afonja, Hui-Po Wang, Raouf Kerkouche, Mario Fritz | Published: 2024-12-03 | Updated: 2025-04-29 2024.12.03 文献データベース
Intermediate Outputs Are More Sensitive Than You Think Authors: Tao Huang, Qingyu Huang, Jiayang Meng | Published: 2024-12-01 2024.12.01 2025.04.03 文献データベース
VLSBench: Unveiling Visual Leakage in Multimodal Safety Authors: Xuhao Hu, Dongrui Liu, Hao Li, Xuanjing Huang, Jing Shao | Published: 2024-11-29 | Updated: 2025-01-17 2024.11.29 2025.04.03 文献データベース
LUMIA: Linear probing for Unimodal and MultiModal Membership Inference Attacks leveraging internal LLM states Authors: Luis Ibanez-Lissen, Lorena Gonzalez-Manzano, Jose Maria de Fuentes, Nicolas Anciaux, Joaquin Garcia-Alfaro | Published: 2024-11-29 | Updated: 2025-01-10 2024.11.29 2025.04.03 文献データベース
CantorNet: A Sandbox for Testing Geometrical and Topological Complexity Measures Authors: Michal Lewandowski, Hamid Eghbalzadeh, Bernhard A. Moser | Published: 2024-11-29 | Updated: 2025-01-28 2024.11.29 2025.04.03 文献データベース
Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment Authors: Soumya Suvra Ghosal, Souradip Chakraborty, Vaibhav Singh, Tianrui Guan, Mengdi Wang, Ahmad Beirami, Furong Huang, Alvaro Velasquez, Dinesh Manocha, Amrit Singh Bedi | Published: 2024-11-27 | Updated: 2025-03-20 2024.11.27 2025.04.03 文献データベース
SoK: Decentralized AI (DeAI) Authors: Zhipeng Wang, Rui Sun, Elizabeth Lui, Vatsal Shah, Xihan Xiong, Jiahao Sun, Davide Crapis, William Knottenbelt | Published: 2024-11-26 | Updated: 2025-04-16 2024.11.26 文献データベース
CleanVul: Automatic Function-Level Vulnerability Detection in Code Commits Using LLM Heuristics Authors: Yikun Li, Ting Zhang, Ratnadira Widyasari, Yan Naing Tun, Huu Hung Nguyen, Tan Bui, Ivana Clairine Irsan, Yiran Cheng, Xiang Lan, Han Wei Ang, Frank Liauw, Martin Weyssow, Hong Jin Kang, Eng Lieh Ouh, Lwin Khin Shar, David Lo | Published: 2024-11-26 | Updated: 2025-01-16 2024.11.26 2025.04.03 文献データベース
CS-Eval: A Comprehensive Large Language Model Benchmark for CyberSecurity Authors: Zhengmin Yu, Jiutian Zeng, Siyi Chen, Wenhan Xu, Dandan Xu, Xiangyu Liu, Zonghao Ying, Nan Wang, Yuan Zhang, Min Yang | Published: 2024-11-25 | Updated: 2025-01-17 2024.11.25 2025.04.03 文献データベース
“Moralized” Multi-Step Jailbreak Prompts: Black-Box Testing of Guardrails in Large Language Models for Verbal Attacks Authors: Libo Wang | Published: 2024-11-23 | Updated: 2025-03-20 2024.11.23 2025.04.03 文献データベース