AIセキュリティポータル K Program
Why Does Differential Privacy with Large Epsilon Defend Against Practical Membership Inference Attacks?
Share
Abstract
For small privacy parameter $\epsilon$, $\epsilon$-differential privacy (DP) provides a strong worst-case guarantee that no membership inference attack (MIA) can succeed at determining whether a person's data was used to train a machine learning model. The guarantee of DP is worst-case because: a) it holds even if the attacker already knows the records of all but one person in the data set; and b) it holds uniformly over all data sets. In practical applications, such a worst-case guarantee may be overkill: practical attackers may lack exact knowledge of (nearly all of) the private data, and our data set might be easier to defend, in some sense, than the worst-case data set. Such considerations have motivated the industrial deployment of DP models with large privacy parameter (e.g. $\epsilon \geq 7$), and it has been observed empirically that DP with large $\epsilon$ can successfully defend against state-of-the-art MIAs. Existing DP theory cannot explain these empirical findings: e.g., the theoretical privacy guarantees of $\epsilon \geq 7$ are essentially vacuous. In this paper, we aim to close this gap between theory and practice and understand why a large DP parameter can prevent practical MIAs. To tackle this problem, we propose a new privacy notion called practical membership privacy (PMP). PMP models a practical attacker's uncertainty about the contents of the private data. The PMP parameter has a natural interpretation in terms of the success rate of a practical MIA on a given data set. We quantitatively analyze the PMP parameter of two fundamental DP mechanisms: the exponential mechanism and Gaussian mechanism. Our analysis reveals that a large DP parameter often translates into a much smaller PMP parameter, which guarantees strong privacy against practical MIAs. Using our findings, we offer principled guidance for practitioners in choosing the DP parameter.
Differential Privacy Overview
Apple
Published: 2016
Improving the gaussian mechanism for differential privacy: Analytical calibration and optimal denoising
Borja Balle, Yu-Xiang Wang
Published: 2018
Coupled-worlds privacy: Exploiting adversarial uncertainty in statistical data privacy
Bassily, R., Groce, A., Katz, J., Smith, A.
Published: 2013
Noiseless database privacy
Bhaskar, R., Bhowmick, A., Goyal, V., Laxman, S., Thakurta, A.
Published: 2011
Membership inference attacks from first principles
Carlini, N., Chien, S., Nasr, M., Song, S., Terzis, A., Tramer, F.
Published: 2022
Extracting Training Data from Large Language Models
Carlini, N., Tramer, F., Wallace, E., Jagielski, M., Herbert-Voss, A., Lee, K., Roberts, A., Brown, T. B., Song, D., Erlingsson, U.
Published: 2021
Collecting telemetry data privately
Ding, B., Kulkarni, J., Yekhanin, S.
Published: 2017
Calibrating noise to sensitivity in private data analysis
Cynthia Dwork, Frank McSherry, Kobbi Nissim, Adam Smith
Published: 2006
Robust traceability from trace amounts
Dwork, C., Smith, A., Steinke, T., Ullman, J., Vadhan, S.
Published: 2015
Resolving individuals contributing trace amounts of DNA to highly complex mixtures using high-density SNP genotyping microarrays
N. Homer, S. Szelinger, M. Redman, D. Duggan, W. Tembe, J. Muehling, J. V. Pearson, D. A. Stephan, S. F. Nelson, D. W. Craig
Published: 2008
What can we learn privately?
Kasiviswanathan, S. P., Lee, H. K., Nissim, K., Raskhodnikova, S., Smith, A.
Published: 2011
A rigorous and customizable framework for privacy
Kifer, D., Machanavajjhala, A.
Published: 2012
Gaussian Membership Inference Privacy
Tobias Leemann, Martin Pawelczyk, Gjergji Kasneci
Published: 2023.6.13
On sampling, anonymization, and differential privacy or, k-anonymization meets differential privacy
Li, N., Qardaji, W., Su, D.
Published: 2012
Membership privacy: A unifying framework for privacy definitions
Li, N., Qardaji, W., Su, D., Wu, Y., Yang, W.
Published: 2013
Towards Measuring Membership Privacy
Yunhui Long, Vincent Bindschaedler, Carl A. Gunter
Published: 2017.12.26
Optimal membership inference bounds for adaptive composition of sampled gaussian mechanisms
Saeed Mahloujifar, Alexandre Sablayrolles, Graham Cormode, Somesh Jha
Published: 2022
Mechanism design via differential privacy
McSherry, F., Talwar, K.
Published: 2007
Computational differential privacy
Mironov, I., Pandey, O., Reingold, O., Vadhan, S.
Published: 2009
Privacy-preserving deep learning
Shokri, R., Shmatikov, V.
Published: 2015
RAP- POR: Randomized Aggregatable Privacy-Preserving Ordinal Response
Ulfar Erlingsson, Pihur, V., Korolova, A.
Published: 2014
Privacy Risk in Machine Learning: Analyzing the Connection to Overfitting
Samuel Yeom, Irene Giacomelli, Matt Fredrikson, Somesh Jha
Published: 2017.9.6
Share