Unbiased Watermark for Large Language Models Authors: Zhengmian Hu, Lichang Chen, Xidong Wu, Yihan Wu, Hongyang Zhang, Heng Huang | Published: 2023-09-22 | Updated: 2023-10-18 2023.09.22 2025.05.28 Literature Database
The Reversal Curse: LLMs trained on “A is B” fail to learn “B is A” Authors: Lukas Berglund, Meg Tong, Max Kaufmann, Mikita Balesni, Asa Cooper Stickland, Tomasz Korbak, Owain Evans | Published: 2023-09-21 | Updated: 2024-05-26 2023.09.21 2025.05.28 Literature Database
Privacy-Preserving In-Context Learning with Differentially Private Few-Shot Generation Authors: Xinyu Tang, Richard Shin, Huseyin A. Inan, Andre Manoel, Fatemehsadat Mireshghallah, Zinan Lin, Sivakanth Gopi, Janardhan Kulkarni, Robert Sim | Published: 2023-09-21 | Updated: 2024-01-28 2023.09.21 2025.05.28 Literature Database
How Robust is Google’s Bard to Adversarial Image Attacks? Authors: Yinpeng Dong, Huanran Chen, Jiawei Chen, Zhengwei Fang, Xiao Yang, Yichi Zhang, Yu Tian, Hang Su, Jun Zhu | Published: 2023-09-21 | Updated: 2023-10-14 2023.09.21 2025.05.28 Literature Database
“It’s a Fair Game”, or Is It? Examining How Users Navigate Disclosure Risks and Benefits When Using LLM-Based Conversational Agents Authors: Zhiping Zhang, Michelle Jia, Hao-Ping Lee, Bingsheng Yao, Sauvik Das, Ada Lerner, Dakuo Wang, Tianshi Li | Published: 2023-09-20 | Updated: 2024-04-02 2023.09.20 2025.05.28 Literature Database
FRAMU: Attention-based Machine Unlearning using Federated Reinforcement Learning Authors: Thanveer Shaik, Xiaohui Tao, Lin Li, Haoran Xie, Taotao Cai, Xiaofeng Zhu, Qing Li | Published: 2023-09-19 | Updated: 2024-02-02 2023.09.19 2025.05.28 Literature Database
Efficient Avoidance of Vulnerabilities in Auto-completed Smart Contract Code Using Vulnerability-constrained Decoding Authors: André Storhaug, Jingyue Li, Tianyuan Hu | Published: 2023-09-18 | Updated: 2023-10-06 2023.09.18 2025.05.28 Literature Database
Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM Authors: Bochuan Cao, Yuanpu Cao, Lu Lin, Jinghui Chen | Published: 2023-09-18 | Updated: 2024-06-12 2023.09.18 2025.05.28 Literature Database
A Duty to Forget, a Right to be Assured? Exposing Vulnerabilities in Machine Unlearning Services Authors: Hongsheng Hu, Shuo Wang, Jiamin Chang, Haonan Zhong, Ruoxi Sun, Shuang Hao, Haojin Zhu, Minhui Xue | Published: 2023-09-15 | Updated: 2024-01-15 2023.09.15 2025.05.28 Literature Database
Multi-Source Domain Adaptation meets Dataset Distillation through Dataset Dictionary Learning Authors: Eduardo Fernandes Montesuma, Fred Ngolè Mboula, Antoine Souloumiac | Published: 2023-09-14 2023.09.14 2025.05.28 Literature Database