DPSUR: Accelerating Differentially Private Stochastic Gradient Descent Using Selective Update and Release

TOP Literature Database DPSUR: Accelerating Differentially Private Stochastic Gradient Descent Using Selective Update and Release

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2311.14056

PDF

https://arxiv.org/pdf/2311.14056

Paper Information

Author: Jie Fu;Qingqing Ye;Haibo Hu;Zhili Chen;Lulu Wang;Kuncan Wang;Xun Ran
Published: 11-24-2023
Updated: 11-29-2023
Affiliation: Shanghai Key Laboratory of Trustworthy Computing, East China Normal University
Country: China
Conference

Labels Estimated by AI

Privacy Protection Evaluation Method Optimization Methods

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Machine learning models are known to memorize private data to reduce their training loss, which can be inadvertently exploited by privacy attacks such as model inversion and membership inference. To protect against these attacks, differential privacy (DP) has become the de facto standard for privacy-preserving machine learning, particularly those popular training algorithms using stochastic gradient descent, such as DPSGD. Nonetheless, DPSGD still suffers from severe utility loss due to its slow convergence. This is partially caused by the random sampling, which brings bias and variance to the gradient, and partially by the Gaussian noise, which leads to fluctuation of gradient updates. Our key idea to address these issues is to apply selective updates to the model training, while discarding those useless or even harmful updates. Motivated by this, this paper proposes DPSUR, a Differentially Private training framework based on Selective Updates and Release, where the gradient from each iteration is evaluated based on a validation test, and only those updates leading to convergence are applied to the model. As such, DPSUR ensures the training in the right direction and thus can achieve faster convergence than DPSGD. The main challenges lie in two aspects -- privacy concerns arising from gradient evaluation, and gradient selection strategy for model update. To address the challenges, DPSUR introduces a clipping strategy for update randomization and a threshold mechanism for gradient selection. Experiments conducted on MNIST, FMNIST, CIFAR-10, and IMDB datasets show that DPSUR significantly outperforms previous works in terms of convergence speed and model utility.

External Datasets

MNIST

FMNIST

CIFAR-10

IMDB