Reconstructing Training Data from Trained Neural Networks

TOP 文献データベース Reconstructing Training Data from Trained Neural Networks

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2206.07758

PDF

https://arxiv.org/pdf/2206.07758

文献情報

作者: Niv Haim;Gal Vardi;Gilad Yehudai;Ohad Shamir;Michal Irani
公開日: 2022-6-16
更新日: 2022-12-5
所属機関: Weizmann Institute of Science
所属の国: Israel
会議名: Conference on Neural Information Processing Systems (NeurIPS)

AIにより推定されたラベル

ハイパーパラメータ調整性能評価指標敵対的学習

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Understanding to what extent neural networks memorize training data is an intriguing question with practical and theoretical implications. In this paper we show that in some cases a significant fraction of the training data can in fact be reconstructed from the parameters of a trained neural network classifier. We propose a novel reconstruction scheme that stems from recent theoretical results about the implicit bias in training neural networks with gradient-based methods. To the best of our knowledge, our results are the first to show that reconstructing a large portion of the actual training samples from a trained neural network classifier is generally possible. This has negative implications on privacy, as it can be used as an attack for revealing sensitive training data. We demonstrate our method for binary MLP classifiers on a few standard computer vision datasets.

外部データセット

CIFAR10

MNIST