Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM

Authors: Bochuan Cao, Yuanpu Cao, Lu Lin, Jinghui Chen | Published: 2023-09-18 | Updated: 2024-06-12

A Duty to Forget, a Right to be Assured? Exposing Vulnerabilities in Machine Unlearning Services

Authors: Hongsheng Hu, Shuo Wang, Jiamin Chang, Haonan Zhong, Ruoxi Sun, Shuang Hao, Haojin Zhu, Minhui Xue | Published: 2023-09-15 | Updated: 2024-01-15

Multi-Source Domain Adaptation meets Dataset Distillation through Dataset Dictionary Learning

Authors: Eduardo Fernandes Montesuma, Fred Ngolè Mboula, Antoine Souloumiac | Published: 2023-09-14

Mitigate Replication and Copying in Diffusion Models with Generalized Caption and Dual Fusion Enhancement

Authors: Chenghao Li, Dake Chen, Yuke Zhang, Peter A. Beerel | Published: 2023-09-13 | Updated: 2024-01-23

A Comprehensive Analysis of the Role of Artificial Intelligence and Machine Learning in Modern Digital Forensics and Incident Response

Authors: Dipo Dunsin, Mohamed C. Ghanem, Karim Ouazzane, Vassil Vassilev | Published: 2023-09-13 | Updated: 2023-12-03

Deep Nonparametric Convexified Filtering for Computational Photography, Image Synthesis and Adversarial Defense

Authors: Jianqiao Wangni | Published: 2023-09-13 | Updated: 2023-09-14

Recovering from Privacy-Preserving Masking with Large Language Models

Authors: Arpita Vats, Zhe Liu, Peng Su, Debjyoti Paul, Yingyi Ma, Yutong Pang, Zeeshan Ahmed, Ozlem Kalinli | Published: 2023-09-12 | Updated: 2023-12-14

SABLE: Secure And Byzantine robust LEarning

Authors: Antoine Choffrut, Rachid Guerraoui, Rafael Pinot, Renaud Sirdey, John Stephan, Martin Zuber | Published: 2023-09-11 | Updated: 2023-12-14

FuzzLLM: A Novel and Universal Fuzzing Framework for Proactively Discovering Jailbreak Vulnerabilities in Large Language Models

Authors: Dongyu Yao, Jianshu Zhang, Ian G. Harris, Marcel Carlsson | Published: 2023-09-11 | Updated: 2024-04-14

Quantifying and Attributing the Hallucination of Large Language Models via Association Analysis

Authors: Li Du, Yequan Wang, Xingrun Xing, Yiqun Ya, Xiang Li, Xin Jiang, Xuezhi Fang | Published: 2023-09-11