DiSK: Differentially Private Optimizer with Simplified Kalman Filter for Noise Reduction | AI Security Portal

JA

JA

EN

TOP Literature Database DiSK: Differentially Private Optimizer with Simplified Kalman Filter for Noise Reduction

arxiv

DiSK: Differentially Private Optimizer with Simplified Kalman Filter for Noise Reduction

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2410.03883

PDF

https://arxiv.org/pdf/2410.03883

Paper Information

Author: Xinwei Zhang,Zhiqi Bu,Borja Balle,Mingyi Hong,Meisam Razaviyayn,Vahab Mirrokni
Published: 10-5-2024
Updated: 4-29-2025
Affiliation: University of Southern California
Country: United States of America
Conference

Labels Estimated by AI

Framework Optimization Problem

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Differential privacy (DP) offers a robust framework for safeguarding individual data privacy. To utilize DP in training modern machine learning models, differentially private optimizers have been widely used in recent years. A popular approach to privatize an optimizer is to clip the individual gradients and add sufficiently large noise to the clipped gradient. This approach led to the development of DP optimizers that have comparable performance with their non-private counterparts in fine-tuning tasks or in tasks with a small number of training parameters. However, a significant performance drop is observed when these optimizers are applied to large-scale training. This degradation stems from the substantial noise injection required to maintain DP, which disrupts the optimizer's dynamics. This paper introduces DiSK, a novel framework designed to significantly enhance the performance of DP optimizers. DiSK employs Kalman filtering, a technique drawn from control and signal processing, to effectively denoise privatized gradients and generate progressively refined gradient estimations. To ensure practicality for large-scale training, we simplify the Kalman filtering process, minimizing its memory and computational demands. We establish theoretical privacy-utility trade-off guarantees for DiSK, and demonstrate provable improvements over standard DP optimizers like DPSGD in terms of iteration complexity upper-bound. Extensive experiments across diverse tasks, including vision tasks such as CIFAR-100 and ImageNet-1k and language fine-tuning tasks such as GLUE, E2E, and DART, validate the effectiveness of DiSK. The results showcase its ability to significantly improve the performance of DP optimizers, surpassing state-of-the-art results under the same privacy constraints on several benchmarks.

External Datasets

MNIST

CIFAR-10

CIFAR-100

ImageNet-1k

GLUE

E2E

DART

References

Proceedings of the 2016 ACM SIGSAC conference on computer and communications security

Deep learning with differential privacy

Martin Abadi, Andy Chu, Ian Goodfellow, H Brendan McMahan, Ilya Mironov, Kunal Talwar, Li Zhang

Published: 2016

International conference on machine learning

Variance reduction for faster non-convex optimization

Zeyuan Allen-Zhu, Elad Hazan

Published: 2016

Advances in Neural Information Processing Systems

Differentially private learning with adaptive clipping

Galen Andrew, Om Thakkar, Brendan McMahan, Swaroop Ramaswamy

Published: 2021

2019 American control conference (ACC)

Analysis of the heavy-ball algorithm using integral quadratic constraints

Apurva Badithela, Peter Seiler

Published: 2019

International Conference on Machine Learning

Improving the gaussian mechanism for differential privacy: Analytical calibration and optimal denoising

Borja Balle, Yu-Xiang Wang

Published: 2018

Advances in Neural Information Processing Systems

Dp-mix: mixup-based data augmentation for differentially private learning

Wenxuan Bao, Francesco Pittaluga, Vijay Kumar BG, Vincent Bindschaedler

Published: 2024

2004 IEEE International Conference on Acoustics, Speech, and Signal Processing

Kalman filtering in stochastic gradient algorithms: construction of a stopping rule

Barbara Bittner, Luc Pronzato

Published: 2004

International Conference on Machine Learning

Differentially private optimization on large model at small cost

Z. Bu, Y. Wang, S. Zha, G. Karypis

Published: 2023

Advances in Neural Information Processing Systems

Automatic clipping: Differentially private deep learning made easier and stronger

Zhiqi Bu, Yu-Xiang Wang, Sheng Zha, George Karypis

Published: 2024

International Conference on Machine Learning

Improve single-point zeroth-order optimization using high-pass and low-pass filters

Xin Chen, Yujie Tang, Na Li

Published: 2022

The Twelfth International Conference on Learning Representations

Correlated noise provably beats independent noise for differentially private learning

Christopher A Choquette-Choo, Krishnamurthy Dj Dvijotham, Krishna Pillutla, Arun Ganesh, Thomas Steinke, Abhradeep Guha Thakurta

Published: 2024