Multi-Epoch Matrix Factorization Mechanisms for Private Machine Learning

TOP Literature Database Multi-Epoch Matrix Factorization Mechanisms for Private Machine Learning

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2211.06530

PDF

https://arxiv.org/pdf/2211.06530

Paper Information

Author: Christopher A. Choquette-Choo;H. Brendan McMahan;Keith Rush;Abhradeep Thakurta
Published: 11-12-2022
Updated: 6-9-2023
Affiliation: Google Research
Country: United States of America
Conference: International Conference on Machine Learning (ICML)

Labels Estimated by AI

Privacy Protection Method Optimization Methods

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

We introduce new differentially private (DP) mechanisms for gradient-based machine learning (ML) with multiple passes (epochs) over a dataset, substantially improving the achievable privacy-utility-computation tradeoffs. We formalize the problem of DP mechanisms for adaptive streams with multiple participations and introduce a non-trivial extension of online matrix factorization DP mechanisms to our setting. This includes establishing the necessary theory for sensitivity calculations and efficient computation of optimal matrices. For some applications like $>\!\! 10,000$ SGD steps, applying these optimal techniques becomes computationally expensive. We thus design an efficient Fourier-transform-based mechanism with only a minor utility loss. Extensive empirical evaluation on both example-level DP for image classification and user-level DP for language modeling demonstrate substantial improvements over all previous methods, including the widely-used DP-SGD . Though our primary application is to ML, our main DP results are applicable to arbitrary linear queries and hence may have much broader applicability.

External Datasets

CIFAR10