Model editing techniques

Decomposing and Editing Predictions by Modeling Model Computation

Authors: Harshay Shah, Andrew Ilyas, Aleksander Madry | Published: 2024-04-17
Watermarking
Model Interpretability
Model editing techniques

DEPN: Detecting and Editing Privacy Neurons in Pretrained Language Models

Authors: Xinwei Wu, Junzhuo Li, Minghui Xu, Weilong Dong, Shuangzhi Wu, Chao Bian, Deyi Xiong | Published: 2023-10-31 | Updated: 2023-12-05
Privacy Protection Method
Privacy Technique
Model editing techniques

Proof of Unlearning: Definitions and Instantiation

Authors: Jiasi Weng, Shenglong Yao, Yuefeng Du, Junjie Huang, Jian Weng, Cong Wang | Published: 2022-10-20 | Updated: 2022-10-21
DNN IP Protection Method
Privacy Risk Management
Model editing techniques