AIセキュリティポータル K Program
Bounding Reconstruction Attack Success of Adversaries Without Data Priors
Share
Abstract
Reconstruction attacks on machine learning (ML) models pose a strong risk of leakage of sensitive data. In specific contexts, an adversary can (almost) perfectly reconstruct training data samples from a trained model using the model's gradients. When training ML models with differential privacy (DP), formal upper bounds on the success of such reconstruction attacks can be provided. So far, these bounds have been formulated under worst-case assumptions that might not hold high realistic practicality. In this work, we provide formal upper bounds on reconstruction success under realistic adversarial settings against ML models trained with DP and support these bounds with empirical results. With this, we show that in realistic scenarios, (a) the expected reconstruction success can be bounded appropriately in different contexts and by different metrics, which (b) allows for a more educated choice of a privacy parameter.
When the Curious Abandon Honesty: Federated Learning Is Not Private
Franziska Boenisch, Adam Dziedzic, Roei Schuster, Ali Shahin Shamsabadi, Ilia Shumailov, Nicolas Papernot
Published: 2021.12.6
Stochastic gradient descent with differentially private updates
S. Song, K. Chaudhuri, A. D. Sarwate
Published: 2013
Deep learning with differential privacy
Martin Abadi, Andy Chu, Ian Goodfellow, H Brendan McMahan, Ilya Mironov, Kunal Talwar, Li Zhang
Published: 2016
Reconstructing Training Data with Informed Adversaries
Borja Balle, Giovanni Cherubin, Jamie Hayes
Published: 2022.1.13
Bounding training data reconstruction in dp-sgd
Jamie Hayes, Saeed Mahloujifar, Borja Balle
Published: 2023
Optimal privacy guarantees for a relaxed threat model: Addressing sub-optimal adversaries in differentially private machine learning
Georgios Kaissis, Alexander Ziller, Stefan Kolek, Anneliese Riess, Daniel Rueckert
Published: 2023
Adversary instantiation: Lower bounds for differentially private machine learning
Milad Nasr, Shuang Songi, Abhradeep Thakurta, Nicolas Papernot, Nicholas Carlini
Published: 2021
Zen and the art of model adaptation: Low-utility-cost attack mitigations in collaborative machine learning
Dmitrii Usynin, Daniel Rueckert, Jonathan Passerat-Palmbach, Georgios Kaissis
Published: 2022
Imagenet: A large-scale hierarchical image database
J. Deng, W. Dong, R. Socher, L. Li, K. Li, L. Fei-Fei
Published: 2009
Extracting training data from diffusion models
Nicolas Carlini, Jamie Hayes, Milad Nasr, Matthew Jagielski, Vikash Sehwag, Florian Tramer, Borja Balle, Daphne Ippolito, Eric Wallace
Published: 2023
Elements of information theory
Thomas M Cover
Published: 1999
An overlap invariant entropy measure of 3d medical image alignment
Colin Studholme, Derek LG Hill, David J Hawkes
Published: 1999
Share