ATTAXONOMY: Unpacking Differential Privacy Guarantees Against Practical Adversaries

Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

The US census bureau adopts differential privacy

J. M. Abowd

Published: 2018

National Bureau of Economic Research

The 2010 Census confidentiality protections failed, here’s how and why

J. M. Abowd, T. Adams, R. Ashmead, D. Darais, S. Dey, S. L. Garfinkel, N. Goldschlag, D. Kifer, P. Leclerc, E. Lew, S. Moore, R. A. Rodr’iguez, R. N. Tadros, L. Vilhuber

Published: 2023

arxiv

Cited by 13

Reconstructing Training Data with Informed Adversaries

Borja Balle, Giovanni Cherubin, Jamie Hayes

Published: 1.13.2022

Given access to a machine learning model, can an adversary reconstruct the model's training data? This work studies this question from the lens of a powerful informed adversary who knows all the training data points except one. By instantiating concrete attacks, we show it is feasible to reconstruct the remaining data point in this stringent threat model. For convex models (e.g. logistic regression), reconstruction attacks are simple and can be derived in closed-form. For more general models (e.g. neural networks), we propose an attack strategy based on training a reconstructor network that receives as input the weights of the model under attack and produces as output the target data point. We demonstrate the effectiveness of our attack on image classifiers trained on MNIST and CIFAR-10, and systematically investigate which factors of standard machine learning pipelines affect reconstruction success. Finally, we theoretically investigate what amount of differential privacy suffices to mitigate reconstruction attacks by informed adversaries. Our work provides an effective reconstruction attack that model developers can use to assess memorization of individual points in general settings beyond those considered in previous works (e.g. generative language models or access to training gradients); it shows that standard models have the capacity to store enough information to enable high-fidelity reconstruction of training data points; and it demonstrates that differential privacy can successfully mitigate such attacks in a parameter regime where utility degradation is minimal.

Reconstruction Attack Poisoning Data Selection Strategy

Cahiers 2020-17, WODC (Research and Data Centre), Dutch Ministry of Justice and Security

On statistical disclosure control technologies for protecting personal data in tabular data sets

M. Bargh, A. Latenko, S. v. d. Braak, M. Vink, R. Meijer

Published: 2020

Internal Revenue Service (IRS)

Safely expanding research access to administrative tax data: creating a synthetic public use file and a validation server

L. E. Burman, A. Engler, S. Khitatrakun, J. R. Nunns, S. Armstrong, J. Iselin, G. MacDonald, P. Stallworth

Published: 2019

Boston University Law Review

Privacy harms

D. K. Citron, D. J. Solove

Published: 2022

Proceedings of the National Academy of Sciences

Towards formalizing the GDPR’s notion of singling out

A. Cohen, K. Nissim

Published: 2020

A list of real-world uses of differential privacy - Ted is writing things

D. Desfontaines

Published: 2021

Proceedings of Privacy Enhancing Technologies

SoK: Differential privacies

D. Desfontaines, B. Pej´o

Published: 2020

Proceedings of the National Academy of Sciences

Confidence-ranked reconstruction of census microdata from published statistics

T. Dick, C. Dwork, M. Kearns, T. Liu, A. Roth, G. Vietri, Z. S. Wu

Published: 2023

Proceedings of the Twenty-Second ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems

Revealing information while preserving privacy

Irit Dinur, Kobbi Nissim

Published: 2003

Springer New York, NY

Statistical Confidentiality: Principles and Practice

G. T. Duncan, M. Elliot, J.-J. Salazar-Gonz´alez

Published: 2011

Theory of Cryptography

Calibrating noise to sensitivity in private data analysis

Cynthia Dwork, Frank McSherry, Kobbi Nissim, Adam Smith

Published: 2006

Annual Review of Statistics and Its Application

Exposed! a survey of attacks on private data

C. Dwork, A. D. Smith, T. Steinke, J. Ullman

Published: 2017

Helping public health officials combat covid-19

J. Fitzpatrick, K. DeSalvo

Published: 2020

Proceedings of the 39th International Conference on Machine Learning

Bounding training data reconstruction in private (deep) learning

Chuan Guo, Brian Karrer, Kamalika Chaudhuri, Laurens van der Maaten

Published: 2022

Thirty-seventh Conference on Neural Information Processing Systems

Bounding training data reconstruction in dp-sgd

Jamie Hayes, Saeed Mahloujifar, Borja Balle

Published: 2023

Differentially private release of israel’s national registry of live births

S. Hod, R. Canetti

Published: 2024

PLOS Genetics

Resolving individuals contributing trace amounts of DNA to highly complex mixtures using high-density SNP genotyping microarrays

N. Homer, S. Szelinger, M. Redman, D. Duggan, W. Tembe, J. Muehling, J. V. Pearson, D. A. Stephan, S. F. Nelson, D. W. Craig

Published: 2008

National Institute of Standards and Technology (NIST)

Guide for Conducting Risk Assessments

Joint Task Force Transformation Initiative

Published: 2012

arxiv

Cited by 3

Computing Research Repository (CoRR)

Bounding data reconstruction attacks with the hypothesis testing interpretation of differential privacy

Georgios Kaissis, Jamie Hayes, Alexander Ziller, Daniel Rueckert

Published: 7.8.2023

We explore Reconstruction Robustness (ReRo), which was recently proposed as an upper bound on the success of data reconstruction attacks against machine learning models. Previous research has demonstrated that differential privacy (DP) mechanisms also provide ReRo, but so far, only asymptotic Monte Carlo estimates of a tight ReRo bound have been shown. Directly computable ReRo bounds for general DP mechanisms are thus desirable. In this work, we establish a connection between hypothesis testing DP and ReRo and derive closed-form, analytic or numerical ReRo bounds for the Laplace and Gaussian mechanisms and their subsampled variants.

Algorithm Design Data Obfuscation Security Assurance

USENIX Conference on Privacy Engineering Practice and Respect

Negotiating privacy/utility trade-offs under differential privacy

G. Miklau

Published: 2022

Israel’s national registry of live births

Ministry of Health - Goverment of Israel

Published: 2024

A unified analysis of label inference attacks

A. Munoz Medina, T. Dick, C. Gentile, R. I. Busa-Fekete, M. Swanberg

Published: 2023

2008 IEEE Symposium on Security and Privacy (sp 2008)

Robust de-anonymization of large sparse datasets

Arvind Narayanan, Vitaly Shmatikov

Published: 2008

2021 IEEE Symposium on security and privacy (SP)

Adversary instantiation: Lower bounds for differentially private machine learning

Milad Nasr, Shuang Songi, Abhradeep Thakurta, Nicolas Papernot, Nicholas Carlini

Published: 2021

ACM Computing Surveys

A survey of privacy attacks in machine learning

M. Rigaki, S. Garc´ıa

Published: 2024

arxiv

Cited by 1

SoK: Let the Privacy Games Begin! A Unified Treatment of Data Inference Privacy in Machine Learning

Ahmed Salem, Giovanni Cherubin, David Evans, Boris Köpf, Andrew Paverd, Anshuman Suri, Shruti Tople, Santiago Zanella-Béguelin

Published: 12.21.2022

Deploying machine learning models in production may allow adversaries to infer sensitive information about training data. There is a vast literature analyzing different types of inference risks, ranging from membership inference to reconstruction attacks. Inspired by the success of games (i.e., probabilistic experiments) to study security properties in cryptography, some authors describe privacy inference risks in machine learning using a similar game-based style. However, adversary capabilities and goals are often stated in subtly different ways from one presentation to the other, which makes it hard to relate and compose results. In this paper, we present a game-based framework to systematize the body of knowledge on privacy inference risks in machine learning. We use this framework to (1) provide a unifying structure for definitions of inference risks, (2) formally establish known relations among definitions, and (3) to uncover hitherto unknown relations that would have been difficult to spot otherwise.

Membership Inference Privacy Enhancing Technology Data Privacy Assessment

Advances in Neural Information Processing Systems (NeurIPS)

Privacy auditing with one (1) training run

Thomas Steinke, Milad Nasr, Matthew Jagielski

Published: 2023

The Journal of Law, Medicine & Ethics

Weaving technology and policy together to maintain confidentiality

L. Sweeney

Published: 1997

Springer New York, NY

Statistical Disclosure Control in Practice

L. Willenborg, T. Waal

Published: 1996

Proceedings of Privacy Enhancing Technologies

Differentially private SQL with bounded user contribution

R. J. Wilson, C. Y. Zhang, W. Lam, D. Desfontaines, D. Simmons-Marengo, B. Gipson

Published: 2020

Proceedings of the International Conference on Management of Data

Privbayes: private data release via bayesian networks

J. Zhang, G. Cormode, C. M. Procopiuc, D. Srivastava, X. Xiao

Published: 2014