Decomposing and Editing Predictions by Modeling Model Computation Authors: Harshay Shah, Andrew Ilyas, Aleksander Madry | Published: 2024-04-17 WatermarkingModel InterpretabilityModel editing techniques 2024.04.17 2025.05.27 Literature Database
DEPN: Detecting and Editing Privacy Neurons in Pretrained Language Models Authors: Xinwei Wu, Junzhuo Li, Minghui Xu, Weilong Dong, Shuangzhi Wu, Chao Bian, Deyi Xiong | Published: 2023-10-31 | Updated: 2023-12-05 Privacy Protection MethodPrivacy TechniqueModel editing techniques 2023.10.31 2025.05.28 Literature Database
Proof of Unlearning: Definitions and Instantiation Authors: Jiasi Weng, Shenglong Yao, Yuefeng Du, Junjie Huang, Jian Weng, Cong Wang | Published: 2022-10-20 | Updated: 2022-10-21 DNN IP Protection MethodPrivacy Risk ManagementModel editing techniques 2022.10.20 2025.05.28 Literature Database