Bounding Reconstruction Attack Success of Adversaries Without Data Priors | AIセキュリティポータル

EN

JA

EN

TOP 文献データベース Bounding Reconstruction Attack Success of Adversaries Without Data Priors

arxiv

Bounding Reconstruction Attack Success of Adversaries Without Data Priors

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2402.12861

PDF

https://arxiv.org/pdf/2402.12861

文献情報

作者: Alexander Ziller;Anneliese Riess;Kristian Schwethelm;Tamara T. Mueller;Daniel Rueckert;Georgios Kaissis
公開日: 2024-2-20
所属機関: Chair for Artificial Intelligence in Medicine, Technical University of Munich
所属の国: Germany
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

透かし評価データプライバシー評価プライバシー保護手法

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Reconstruction attacks on machine learning (ML) models pose a strong risk of leakage of sensitive data. In specific contexts, an adversary can (almost) perfectly reconstruct training data samples from a trained model using the model's gradients. When training ML models with differential privacy (DP), formal upper bounds on the success of such reconstruction attacks can be provided. So far, these bounds have been formulated under worst-case assumptions that might not hold high realistic practicality. In this work, we provide formal upper bounds on reconstruction success under realistic adversarial settings against ML models trained with DP and support these bounds with empirical results. With this, we show that in realistic scenarios, (a) the expected reconstruction success can be bounded appropriately in different contexts and by different metrics, which (b) allows for a more educated choice of a privacy parameter.

外部データセット

ImageNet

参考文献

Robbing the fed: Directly obtaining private data in federated learning with modified models

Liam Fowl, Jonas Geiping, Wojtek Czaja, Micah Goldblum, Tom Goldstein

Published: 2021

European Symposium on Security and Privacy (EuroS&P)

When the Curious Abandon Honesty: Federated Learning Is Not Private

Franziska Boenisch, Adam Dziedzic, Roei Schuster, Ali Shahin Shamsabadi, Ilia Shumailov, Nicolas Papernot

Published: 2021.12.6

In federated learning (FL), data does not leave personal devices when they are jointly training a machine learning model. Instead, these devices share gradients, parameters, or other model updates, with a central party (e.g., a company) coordinating the training. Because data never "leaves" personal devices, FL is often presented as privacy-preserving. Yet, recently it was shown that this protection is but a thin facade, as even a passive, honest-but-curious attacker observing gradients can reconstruct data of individual users contributing to the protocol. In this work, we show a novel data reconstruction attack which allows an active and dishonest central party to efficiently extract user data from the received gradients. While prior work on data reconstruction in FL relies on solving computationally expensive optimization problems or on making easily detectable modifications to the shared model's architecture or parameters, in our attack the central party makes inconspicuous changes to the shared model's weights before sending them out to the users. We call the modified weights of our attack trap weights. Our active attacker is able to recover user data perfectly, i.e., with zero error, even when this data stems from the same class. Recovery comes with near-zero costs: the attack requires no complex optimization objectives. Instead, our attacker exploits inherent data leakage from model gradients and simply amplifies this effect by maliciously altering the weights of the shared model through the trap weights. These specificities enable our attack to scale to fully-connected and convolutional deep neural networks trained with large mini-batches of data. For example, for the high-dimensional vision dataset ImageNet, we perfectly reconstruct more than 50% of the training data points from mini-batches as large as 100 data points.

データ抽出と分析ポイズニングトレーニングデータ抽出手法

IEEE Global Conference on Signal and Information Processing

Stochastic gradient descent with differentially private updates

S. Song, K. Chaudhuri, A. D. Sarwate

Published: 2013

Proceedings of the 2016 ACM SIGSAC conference on computer and communications security

Deep learning with differential privacy

Martin Abadi, Andy Chu, Ian Goodfellow, H Brendan McMahan, Ilya Mironov, Kunal Talwar, Li Zhang

Published: 2016

Proceedings of the 39th International Conference on Machine Learning

Bounding training data reconstruction in private (deep) learning

Chuan Guo, Brian Karrer, Kamalika Chaudhuri, Laurens van der Maaten

Published: 2022

被引用数 13

Reconstructing Training Data with Informed Adversaries

Borja Balle, Giovanni Cherubin, Jamie Hayes

Published: 2022.1.13

Given access to a machine learning model, can an adversary reconstruct the model's training data? This work studies this question from the lens of a powerful informed adversary who knows all the training data points except one. By instantiating concrete attacks, we show it is feasible to reconstruct the remaining data point in this stringent threat model. For convex models (e.g. logistic regression), reconstruction attacks are simple and can be derived in closed-form. For more general models (e.g. neural networks), we propose an attack strategy based on training a reconstructor network that receives as input the weights of the model under attack and produces as output the target data point. We demonstrate the effectiveness of our attack on image classifiers trained on MNIST and CIFAR-10, and systematically investigate which factors of standard machine learning pipelines affect reconstruction success. Finally, we theoretically investigate what amount of differential privacy suffices to mitigate reconstruction attacks by informed adversaries. Our work provides an effective reconstruction attack that model developers can use to assess memorization of individual points in general settings beyond those considered in previous works (e.g. generative language models or access to training gradients); it shows that standard models have the capacity to store enough information to enable high-fidelity reconstruction of training data points; and it demonstrates that differential privacy can successfully mitigate such attacks in a parameter regime where utility degradation is minimal.

再構成攻撃ポイズニングデータ選択戦略

Thirty-seventh Conference on Neural Information Processing Systems

Bounding training data reconstruction in dp-sgd

Jamie Hayes, Saeed Mahloujifar, Borja Balle

Published: 2023

Computing Research Repository (CoRR)

Bounding data reconstruction attacks with the hypothesis testing interpretation of differential privacy

Georgios Kaissis, Jamie Hayes, Alexander Ziller, Daniel Rueckert

Published: 2023.7.8

We explore Reconstruction Robustness (ReRo), which was recently proposed as an upper bound on the success of data reconstruction attacks against machine learning models. Previous research has demonstrated that differential privacy (DP) mechanisms also provide ReRo, but so far, only asymptotic Monte Carlo estimates of a tight ReRo bound have been shown. Directly computable ReRo bounds for general DP mechanisms are thus desirable. In this work, we establish a connection between hypothesis testing DP and ReRo and derive closed-form, analytic or numerical ReRo bounds for the Laplace and Gaussian mechanisms and their subsampled variants.

アルゴリズム設計データの隠蔽セキュリティ保証

Thirty-seventh Conference on Neural Information Processing Systems

Optimal privacy guarantees for a relaxed threat model: Addressing sub-optimal adversaries in differentially private machine learning

Georgios Kaissis, Alexander Ziller, Stefan Kolek, Anneliese Riess, Daniel Rueckert

Published: 2023

2021 IEEE Symposium on security and privacy (SP)

Adversary instantiation: Lower bounds for differentially private machine learning

Milad Nasr, Shuang Songi, Abhradeep Thakurta, Nicolas Papernot, Nicholas Carlini

Published: 2021

The American Statistician

Thirteen ways to look at the correlation coefficient

Joseph Lee Rodgers, W. Alan Nicewander

Published: 1988

Proc. Priv. Enhancing Technol.

Zen and the art of model adaptation: Low-utility-cost attack mitigations in collaborative machine learning

Dmitrii Usynin, Daniel Rueckert, Jonathan Passerat-Palmbach, Georgios Kaissis

Published: 2022

Journal of the Royal Statistical Society Series B: Statistical Methodology

Gaussian differential privacy

Jinshuo Dong, Aaron Roth, Weijie J Su

Published: 2022

2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition CVPR

Imagenet: A large-scale hierarchical image database

J. Deng, W. Dong, R. Socher, L. Li, K. Li, L. Fei-Fei

Published: 2009

Advances in Neural Information Processing Systems

Denoising diffusion probabilistic models

Jonathan Ho, Ajay Jain, Pieter Abbeel

Published: 2020

International Conference on Learning Representations

Denoising diffusion implicit models

Jiaming Song, Chenlin Meng, Stefano Ermon

Published: 2021

Conference on Neural Information Processing Systems (NeurIPS)

Reconstructing Training Data from Trained Neural Networks

Niv Haim, Gal Vardi, Gilad Yehudai, Ohad Shamir, Michal Irani

Published: 2022.6.16

Understanding to what extent neural networks memorize training data is an intriguing question with practical and theoretical implications. In this paper we show that in some cases a significant fraction of the training data can in fact be reconstructed from the parameters of a trained neural network classifier. We propose a novel reconstruction scheme that stems from recent theoretical results about the implicit bias in training neural networks with gradient-based methods. To the best of our knowledge, our results are the first to show that reconstructing a large portion of the actual training samples from a trained neural network classifier is generally possible. This has negative implications on privacy, as it can be used as an attack for revealing sensitive training data. We demonstrate our method for binary MLP classifiers on a few standard computer vision datasets.

ハイパーパラメータ調整性能評価指標敵対的学習

32nd USENIX Security Symposium (USENIX Security 23)

Extracting training data from diffusion models

Nicolas Carlini, Jamie Hayes, Milad Nasr, Matthew Jagielski, Vikash Sehwag, Florian Tramer, Borja Balle, Daphne Ippolito, Eric Wallace

Published: 2023

John Wiley & Sons

Elements of information theory

Thomas M Cover

Published: 1999

Pattern recognition

An overlap invariant entropy measure of 3d medical image alignment

Colin Studholme, Derek LG Hill, David J Hawkes

Published: 1999