Have it your way: Individualized Privacy Assignment for DP-SGD

Proceedings of the 2016 ACM SIGSAC conference on computer and communications security

Deep learning with differential privacy

Martin Abadi, Andy Chu, Ian Goodfellow, H Brendan McMahan, Ilya Mironov, Kunal Talwar, Li Zhang

Published: 2016

arXiv preprint

Heterogeneous differential privacy

Mohammad Alaggan, Sébastien Gambs, Anne-Marie Kermarrec

Published: 2015

International Conference on Artificial Intelligence and Statistics

Hypothesis testing interpretations and renyi differential privacy

Borja Balle, Gilles Barthe, Marco Gaboardi, Justin Hsu, Tetsuya Sato

Published: 2020

Theory of Cryptography Conference

Bounds on the sample complexity for private learning and private data release

Amos Beimel, Shiva Prasad Kasiviswanathan, Kobbi Nissim

Published: 2010

Communications of the ACM

Privacy in e-commerce: Stated preferences vs. actual behavior

Bettina Berendt, Oliver Günther, Sarah Spiekermann

Published: 2005

Individualized PATE: Differentially Private Machine Learning with Individual Privacy Guarantees

Franziska Boenisch, Christopher Mühl, Roy Rinberg, Jannis Ihrig, Adam Dziedzic

Published: 2022

Empirical Methods in Natural Language Processing

A large annotated corpus for learning natural language inference

Bowman, S. R., Angeli, G., Potts, C., Manning, C. D.

Published: 2015

2022 IEEE Symposium on Security and Privacy (SP)

Membership inference attacks from first principles

Nicholas Carlini, Steve Chien, Milad Nasr, Shuang Song, Andreas Terzis, Florian Tramer

Published: 2022

Proceedings of the 2021 ACM SIGSAC Conference on Computer and Communications Security

An investigation into user expectations for differential privacy

Rachel Cummings, Gabriel Kaptchuk, Elissa M Redmiles

Published: 2021

Theory and Applications of Models of Computation: 5th International Conference, TAMC 2008

Differential privacy: A survey of results

C. Dwork

Published: 2008

Thirty-Fifth Conference on Neural Information Processing Systems

Individual privacy accounting via a renyi filter

Vitaly Feldman, Tijana Zrnic

Published: 2021

arXiv preprint

Communicating privacy guarantees of differential privacy with risk communication formats

Daniel Franzen, Saskia Nuñez von Voigt, Peter Sörries, Florian Tschorsch, Claudia Müller-Birn

Published: 2022

Proceedings of the 22nd ACM SIGSAC Conference on Computer and Communications Security

Model inversion attacks that exploit confidence information and basic countermeasures

Matt Fredrikson, Somesh Jha, Thomas Ristenpart

Published: 2015

2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton)

Property testing for differential privacy

Anna C Gilbert, Audra McMillan

Published: 2018

arxiv

Cited by 1

Conference on Neural Information Processing Systems (NeurIPS)

Reconstructing Training Data from Trained Neural Networks

Niv Haim, Gal Vardi, Gilad Yehudai, Ohad Shamir, Michal Irani

Published: 6.16.2022

Understanding to what extent neural networks memorize training data is an intriguing question with practical and theoretical implications. In this paper we show that in some cases a significant fraction of the training data can in fact be reconstructed from the parameters of a trained neural network classifier. We propose a novel reconstruction scheme that stems from recent theoretical results about the implicit bias in training neural networks with gradient-based methods. To the best of our knowledge, our results are the first to show that reconstructing a large portion of the actual training samples from a trained neural network classifier is generally possible. This has negative implications on privacy, as it can be used as an attack for revealing sensitive training data. We demonstrate our method for binary MLP classifiers on a few standard computer vision datasets.

Hyperparameter Tuning Performance Evaluation Metrics Adversarial Learning

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

Deep residual learning for image recognition

Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun

Published: 2016

International Journal of Human-Computer Studies

Privacy practices of internet users: Self-reports versus observed behavior

Carlos Jensen, Colin Potts, Christian Jensen

Published: 2005

Advances in Neural Information Processing Systems

Differentially private bagging: Improved utility and cheaper privacy than subsample-and-aggregate

James Jordon, Jinsung Yoon, Mihaela van der Schaar

Published: 2019

2015 IEEE 31St international conference on data engineering

Conservative or liberal? personalized differential privacy

Zach Jorgensen, Ting Yu, Graham Cormode

Published: 2015

International Conference on Machine Learning

Practical and private (deep) learning without sampling or shuffling

Peter Kairouz, Brendan McMahan, Shuang Song, Om Thakkar, Abhradeep Thakurta, Zheng Xu

Published: 2021

Proceedings of NAACL-HLT

Bert: Pre-training of deep bidirectional transformers for language understanding

Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova

Published: 2019

Learning multiple layers of features from tiny images

Alex Krizhevsky, Geoffrey Hinton

Published: 2009

MNIST handwritten digit database

LeCun, Y., Cortes, C., Burges, C.

Published: 2010

Pacific-Asia Conference on Knowledge Discovery and Data Mining

Partitioning-based mechanisms under personalized differential privacy

Haoran Li, Li Xiong, Zhanglong Ji, Xiaoqian Jiang

Published: 2017

Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies

Learning word vectors for sentiment analysis

Andrew L. Maas, Raymond E. Daly, Peter T. Pham, Dan Huang, Andrew Y. Ng, Christopher Potts

Published: 2011

arxiv

Cited by 1

Communication-Efficient Learning of Deep Networks from Decentralized Data

H. Brendan McMahan, Eider Moore, Daniel Ramage, Seth Hampson, Blaise Agüera y Arcas

Published: 2.18.2016

Modern mobile devices have access to a wealth of data suitable for learning models, which in turn can greatly improve the user experience on the device. For example, language models can improve speech recognition and text entry, and image models can automatically select good photos. However, this rich data is often privacy sensitive, large in quantity, or both, which may preclude logging to the data center and training there using conventional approaches. We advocate an alternative that leaves the training data distributed on the mobile devices, and learns a shared model by aggregating locally-computed updates. We term this decentralized approach Federated Learning. We present a practical method for the federated learning of deep networks based on iterative model averaging, and conduct an extensive empirical evaluation, considering five different model architectures and four datasets. These experiments demonstrate the approach is robust to the unbalanced and non-IID data distributions that are a defining characteristic of this setting. Communication costs are the principal constraint, and we show a reduction in required communication rounds by 10-100x as compared to synchronized stochastic gradient descent.

Deep Learning Method Federated Learning Reduction of Communication Costs

2017 IEEE 30th computer security foundations symposium (CSF)

Rényi differential privacy

Ilya Mironov

Published: 2017

arxiv

Cited by 8

Rényi Differential Privacy of the Sampled Gaussian Mechanism

Ilya Mironov, Kunal Talwar, Li Zhang

Published: 8.28.2019

The Sampled Gaussian Mechanism (SGM)---a composition of subsampling and the additive Gaussian noise---has been successfully used in a number of machine learning applications. The mechanism's unexpected power is derived from privacy amplification by sampling where the privacy cost of a single evaluation diminishes quadratically, rather than linearly, with the sampling rate. Characterizing the precise privacy properties of SGM motivated development of several relaxations of the notion of differential privacy. This work unifies and fills in gaps in published results on SGM. We describe a numerically stable procedure for precise computation of SGM's R\'enyi Differential Privacy and prove a nearly tight (within a small constant factor) closed-form bound.

Privacy Assessment Information-Theoretic Privacy Sample Complexity

Reading digits in natural images with unsupervised feature learning

Yuval Netzer, Tao Wang, Adam Coates, Alessandro Bissacco, Bo Wu, Andrew Y Ng

Published: 2011

2020 IEEE Wireless Communications and Networking Conference (WCNC)

Utility-aware exponential mechanism for personalized differential privacy

Ben Niu, Yahong Chen, Boyang Wang, Jin Cao, Fenghua Li

Published: 2020

IEEE INFOCOM 2021-IEEE Conference on Computer Communications

Adapdp: Adaptive personalized differential privacy

Ben Niu, Yahong Chen, Boyang Wang, Zhibo Wang, Fenghua Li, Jin Cao

Published: 2021

Opacus get_noise_multiplier-function

arxiv

Cited by 2

International Conference on Learning Representations (ICLR)

Semi-supervised Knowledge Transfer for Deep Learning from Private Training Data

Nicolas Papernot, Martín Abadi, Úlfar Erlingsson, Ian Goodfellow, Kunal Talwar

Published: 10.19.2016

Some machine learning applications involve training data that is sensitive, such as the medical histories of patients in a clinical trial. A model may inadvertently and implicitly store some of its training data; careful analysis of the model may therefore reveal sensitive information. To address this problem, we demonstrate a generally applicable approach to providing strong privacy guarantees for training data: Private Aggregation of Teacher Ensembles (PATE). The approach combines, in a black-box fashion, multiple models trained with disjoint datasets, such as records from different subsets of users. Because they rely directly on sensitive data, these models are not published, but instead used as "teachers" for a "student" model. The student learns to predict an output chosen by noisy voting among all of the teachers, and cannot directly access an individual teacher or the underlying data or parameters. The student's privacy properties can be understood both intuitively (since no single teacher and thus no single dataset dictates the student's training) and formally, in terms of differential privacy. These properties hold even if an adversary can not only query the student but also inspect its internal workings. Compared with previous work, the approach imposes only weak assumptions on how teachers are trained: it applies to any model, including non-convex models like DNNs. We achieve state-of-the-art privacy/utility trade-offs on MNIST and SVHN thanks to an improved privacy analysis and semi-supervised learning.

Privacy-Preserving Machine Learning Differential Privacy Self-Supervised Learning

International Conference on Learning Representations

Scalable private learning with pate

Nicolas Papernot, Shuang Song, Ilya Mironov, Ananth Raghunathan, Kunal Talwar, Ulfar Erlingsson

Published: 2018

arxiv

Cited by 1

Updates-Leak: Data Set Inference and Reconstruction Attacks in Online Learning

Ahmed Salem, Apratim Bhattacharya, Michael Backes, Mario Fritz, Yang Zhang

Published: 4.2.2019

Machine learning (ML) has progressed rapidly during the past decade and the major factor that drives such development is the unprecedented large-scale data. As data generation is a continuous process, this leads to ML model owners updating their models frequently with newly-collected data in an online learning scenario. In consequence, if an ML model is queried with the same set of data samples at two different points in time, it will provide different results. In this paper, we investigate whether the change in the output of a black-box ML model before and after being updated can leak information of the dataset used to perform the update, namely the updating set. This constitutes a new attack surface against black-box ML models and such information leakage may compromise the intellectual property and data privacy of the ML model owner. We propose four attacks following an encoder-decoder formulation, which allows inferring diverse information of the updating set. Our new attacks are facilitated by state-of-the-art deep learning techniques. In particular, we propose a hybrid generative model (CBM-GAN) that is based on generative adversarial networks (GANs) but includes a reconstructive loss that allows reconstructing accurate samples. Our experiments show that the proposed attacks achieve strong performance.