Understanding Reconstruction Attacks with the Neural Tangent Kernel and Dataset Distillation

TOP Literature Database Understanding Reconstruction Attacks with the Neural Tangent Kernel and Dataset Distillation

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2302.01428

PDF

https://arxiv.org/pdf/2302.01428

Paper Information

Author: Noel Loo;Ramin Hasani;Mathias Lechner;Alexander Amini;Daniela Rus
Published: 2-3-2023
Updated: 11-10-2023
Affiliation: MIT CSAIL
Country: United States of America
Conference: International Conference on Learning Representations (ICLR)

Labels Estimated by AI

Reconstruction Durability Privacy Enhancing Technology Model Performance Evaluation

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Modern deep learning requires large volumes of data, which could contain sensitive or private information that cannot be leaked. Recent work has shown for homogeneous neural networks a large portion of this training data could be reconstructed with only access to the trained network parameters. While the attack was shown to work empirically, there exists little formal understanding of its effective regime which datapoints are susceptible to reconstruction. In this work, we first build a stronger version of the dataset reconstruction attack and show how it can provably recover the \emph{entire training set} in the infinite width regime. We then empirically study the characteristics of this attack on two-layer networks and reveal that its success heavily depends on deviations from the frozen infinite-width Neural Tangent Kernel limit. Next, we study the nature of easily-reconstructed images. We show that both theoretically and empirically, reconstructed images tend to "outliers" in the dataset, and that these reconstruction attacks can be used for \textit{dataset distillation}, that is, we can retrain on reconstructed images and obtain high predictive accuracy.

External Datasets

MNIST

CIFAR-10