Rényi Pufferfish Privacy: General Additive Noise Mechanisms and Privacy Amplification by Iteration Authors: Clément Pierquin, Aurélien Bellet, Marc Tommasi, Matthieu Boussard | Published: 2023-12-21 | Updated: 2024-06-10 WatermarkingPrivacy Protection MethodWatermark Evaluation 2023.12.21 2025.05.27 Literature Database
Rethinking Robustness of Model Attributions Authors: Sandesh Kamath, Sankalp Mittal, Amit Deshpande, Vineeth N Balasubramanian | Published: 2023-12-16 Robustness EvaluationWatermark RobustnessWatermark Evaluation 2023.12.16 2025.05.27 Literature Database
Silent Guardian: Protecting Text from Malicious Exploitation by Large Language Models Authors: Jiawei Zhao, Kejiang Chen, Xiaojian Yuan, Yuang Qi, Weiming Zhang, Nenghai Yu | Published: 2023-12-15 | Updated: 2024-10-10 Privacy Protection MethodPrompt InjectionWatermark Evaluation 2023.12.15 2025.05.27 Literature Database
Unsupervised and Supervised learning by Dense Associative Memory under replica symmetry breaking Authors: Linda Albanese, Andrea Alessandrelli, Alessia Annibale, Adriano Barra | Published: 2023-12-15 Convergence PropertyWatermark RobustnessWatermark Evaluation 2023.12.15 2025.05.27 Literature Database
Data-Free Hard-Label Robustness Stealing Attack Authors: Xiaojian Yuan, Kejiang Chen, Wen Huang, Jie Zhang, Weiming Zhang, Nenghai Yu | Published: 2023-12-10 | Updated: 2023-12-12 WatermarkingRobustness EvaluationWatermark Evaluation 2023.12.10 2025.05.28 Literature Database
Provable Adversarial Robustness for Group Equivariant Tasks: Graphs, Point Clouds, Molecules, and More Authors: Jan Schuchardt, Yan Scholten, Stephan Günnemann | Published: 2023-12-05 | Updated: 2024-01-15 Robustness EvaluationWatermark RobustnessWatermark Evaluation 2023.12.05 2025.05.28 Literature Database
Tree of Attacks: Jailbreaking Black-Box LLMs Automatically Authors: Anay Mehrotra, Manolis Zampetakis, Paul Kassianik, Blaine Nelson, Hyrum Anderson, Yaron Singer, Amin Karbasi | Published: 2023-12-04 | Updated: 2024-10-31 Query Generation MethodPrompt InjectionWatermark Evaluation 2023.12.04 2025.05.28 Literature Database
FRAUDability: Estimating Users’ Susceptibility to Financial Fraud Using Adversarial Machine Learning Authors: Chen Doytshman, Satoru Momiyama, Inderjeet Singh, Yuval Elovici, Asaf Shabtai | Published: 2023-12-02 WatermarkingFraudulent TransactionWatermark Evaluation 2023.12.02 2025.05.28 Literature Database
Deep Unlearning: Fast and Efficient Gradient-free Approach to Class Forgetting Authors: Sangamesh Kodge, Gobinda Saha, Kaushik Roy | Published: 2023-12-01 | Updated: 2024-08-05 WatermarkingMachine UnlearningWatermark Evaluation 2023.12.01 2025.05.28 Literature Database
Mark My Words: Analyzing and Evaluating Language Model Watermarks Authors: Julien Piet, Chawin Sitawarin, Vivian Fang, Norman Mu, David Wagner | Published: 2023-12-01 | Updated: 2024-10-11 Prompt InjectionWatermark RobustnessWatermark Evaluation 2023.12.01 2025.05.28 Literature Database