AIセキュリティポータル K Program
On the Query Complexity of Training Data Reconstruction in Private Learning
Share
Abstract
We analyze the number of queries that a whitebox adversary needs to make to a private learner in order to reconstruct its training data. For $(\epsilon, \delta)$ DP learners with training data drawn from any arbitrary compact metric space, we provide the \emph{first known lower bounds on the adversary's query complexity} as a function of the learner's privacy parameters. \emph{Our results are minimax optimal for every $\epsilon \geq 0, \delta \in [0, 1]$, covering both $\epsilon$-DP and $(0, \delta)$ DP as corollaries}. Beyond this, we obtain query complexity lower bounds for $(\alpha, \epsilon)$ R\'enyi DP learners that are valid for any $\alpha > 1, \epsilon \geq 0$. Finally, we analyze data reconstruction attacks on locally compact metric spaces via the framework of Metric DP, a generalization of DP that accounts for the underlying metric structure of the data. In this setting, we provide the first known analysis of data reconstruction in unbounded, high dimensional spaces and obtain query complexity lower bounds that are nearly tight modulo logarithmic factors.
Moment estimates derived from poincare and logarithmic sobolev inequalities
Shigeki Aida, Daniel Stroock
Published: 1994
Reconstructing Training Data with Informed Adversaries
Borja Balle, Giovanni Cherubin, Jamie Hayes
Published: 1.13.2022
Poincare’s inequalities and talagrand’s concentration phenomenon for the exponential distribution
Sergey Bobkov, Michel Ledoux
Published: 1997
Spectral gap and concentration for some spherically symmetric probability measures
Sergey G Bobkov
Published: 2003
Private measures, random walks, and synthetic data
March Boedihardjo, Thomas Strohmer, Roman Vershynin
Published: 2022
Language models are few-shot learners
T. B. Brown, B. Mann, N. Ryder, M. Subbiah, J. Kaplan, P. Dhariwal, A. Neelakantan, P. Shyam, G. Sastry, A. Askell, S. Agarwal, A. Herbert-Voss, G. Krueger, T. Henighan, R. Child, A. Ramesh, D. M. Ziegler, J. Wu, C. Winter, C. Hesse, M. Chen, E. Sigler, M. Litwin, S. Gray, B. Chess, J. Clark, C. Berner, S. McCandlish, A. Radford, I. Sutskever, D. Amodei
Published: 2020
Network size and size of the weights in memorization with two-layers neural networks
Sebastien Bubeck, Ronen Eldan, Yin Tat Lee, Dan Mikulincer
Published: 2020
Concentrated differential privacy: Simplifications, extensions, and lower bounds
Mark Bun, Thomas Steinke
Published: 2016
Extracting training data from large language models
Nicholas Carlini, Florian Tramer, Eric Wallace, Matthew Jagielski, Ariel Herbert-Voss, Katherine Lee, Adam Roberts, Tom Brown, Dawn Song, Ulfar Erlingsson, Alina Oprea, Colin Raffel
Published: 2021
Broadening the scope of differential privacy using metrics
K. Chatzikokolakis, M. E. Andres, N. E. Bordenabe, C. Palamidessi
Published: 2013
Privacy-preserving logistic regression
Kamalika Chaudhuri, Claire Monteleoni
Published: 2008
Concentrated differential privacy
Cynthia Dwork, Guy N. Rothblum
Published: 2016
Scalar poincare implies matrix poincaré
Ankit Garg, Tarun Kathuria, Nikhil Srivastava
Published: 2020
Property testing and its connection to learning and approximation
Oded Goldreich, Shari Goldwasser, Dana Ron
Published: 1998
From poincare inequalities to nonlinear matrix concentration
De Huang, Joel A Tropp
Published: 2021
The composition theorem for differential privacy
Peter Kairouz, Sewoong Oh, Pramod Viswanath
Published: 2015
An introduction to computational learning theory
Michael J Kearns, Umesh Vazirani
Published: 1994
Gradient-based learning applied to document recognition
Y. Lecun, L. Bottou, Y. Bengio, P. Haffner
Published: 1998
Markov chains and mixing times
David A Levin, Yuval Peres
Published: 2017
Optimal membership inference bounds for adaptive composition of sampled gaussian mechanisms
Saeed Mahloujifar, Alexandre Sablayrolles, Graham Cormode, Somesh Jha
Published: 2022
Mechanism design via differential privacy
Frank McSherry, Kunal Talwar
Published: 2007
The complexity of computing the optimal composition of differential privacy
Jack Murtagh, Salil Vadhan
Published: 2015
Zero-shot text-to-image generation
Aditya Ramesh, Mikhail Pavlov, Gabriel Goh, Scott Gray, Chelsea Voss, Alec Radford, Mark Chen, Ilya Sutskever
Published: 2021
Principles of mathematical analysis
Walter Rudin
Published: 1976
Auditing data privacy for machine learning
Reza Shokri
Published: 2022
Stochastic gradient descent with differentially private updates
S. Song, K. Chaudhuri, A. D. Sarwate
Published: 2013
Theory of games and economic behavior
John Von Neumann, Oskar Morgenstern
Published: 1947
High-dimensional statistics: A non-asymptotic viewpoint.
Wainwright, M.J.
Published: 2019
Randomized response: a survey technique for eliminating evasive answer bias
Warner, S. L.
Published: 1965
Privacy Risk in Machine Learning: Analyzing the Connection to Overfitting
Samuel Yeom, Irene Giacomelli, Matt Fredrikson, Somesh Jha
Published: 9.6.2017
Share