These labels were automatically added by AI and may be inaccurate. For details, see About Literature Database.
Abstract
Deep learning has shown incredible potential across a wide array of tasks,
and accompanied by this growth has been an insatiable appetite for data.
However, a large amount of data needed for enabling deep learning is stored on
personal devices, and recent concerns on privacy have further highlighted
challenges for accessing such data. As a result, federated learning (FL) has
emerged as an important privacy-preserving technology that enables
collaborative training of machine learning models without the need to send the
raw, potentially sensitive, data to a central server. However, the fundamental
premise that sending model updates to a server is privacy-preserving only holds
if the updates cannot be "reverse engineered" to infer information about the
private training data. It has been shown under a wide variety of settings that
this privacy premise does not hold.
In this survey paper, we provide a comprehensive literature review of the
different privacy attacks and defense methods in FL. We identify the current
limitations of these attacks and highlight the settings in which the privacy of
an FL client can be broken. We further dissect some of the successful industry
applications of FL and draw lessons for future successful adoption. We survey
the emerging landscape of privacy regulation for FL and conclude with future
directions for taking FL toward the cherished goal of generating accurate
models while preserving the privacy of the data from its participants.
Proceedings of the 22nd ACM SIGSAC Conference on Computer and Communications Security
Model inversion attacks that exploit confidence information and basic countermeasures
Matt Fredrikson, Somesh Jha, Thomas Ristenpart
Published: 2015
Nature Communications
Truly privacy-preserving federated analytics for precision medicine with multiparty homomorphic encryption
David Froelicher, Juan R Troncoso-Pastoriza, Jean Louis Raisaro, Michel A Cuendet, Joao Sa Sousa, Hyunghoon Cho, Bonnie Berger, Jacques Fellay, Jean-Pierre Hubaux
Published: 2021
CCS’18. ACM
Property inference attacks on fully connected neural networks using permutation invariant representations
K. Ganju, Q. Wang, W. Yang, C. A. Gunter, N. Borisov
Do gradient inversion attacks make federated learning unsafe?
Ali Hatamizadeh, Hongxu Yin, Pavlo Molchanov, Andriy Myronenko, Wenqi Li, Prerna Dogra, Andrew Feng, Mona G Flores, Jan Kautz, Daguang Xu
Published: 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition
Gradvit: Gradient inversion of vision transformers
Ali Hatamizadeh, Hongxu Yin, Holger R Roth, Wenqi Li, Jan Kautz, Daguang Xu, Pavlo Molchanov
Published: 2022
Journal of Chemical Information and Modeling
Melloddy: Cross-pharma federated learning at unprecedented scale unlocks benefits in qsar without compromising proprietary information
Wouter Heyndrickx, Lewis Mervin, Tobias Morawietz, Noé Sturm, Lukas Friedrich, Adam Zalewski, Anastasia Pentina, Lina Humbeck, Martijn Oldenhof, Ritsuya Niwayama
Published: 2023
arxiv
Cited by 1
Annual ACM Conference on Computer and Communications Security (CCS)
Deep Models Under the GAN: Information Leakage from Collaborative Deep Learning
Briland Hitaj, Giuseppe Ateniese, Fernando Perez-Cruz
Published: 2.24.2017
Deep Learning has recently become hugely popular in machine learning,
providing significant improvements in classification accuracy in the presence
of highly-structured and large databases.
Researchers have also considered privacy implications of deep learning.
Models are typically trained in a centralized manner with all the data being
processed by the same training algorithm. If the data is a collection of users'
private data, including habits, personal pictures, geographical positions,
interests, and more, the centralized server will have access to sensitive
information that could potentially be mishandled. To tackle this problem,
collaborative deep learning models have recently been proposed where parties
locally train their deep learning structures and only share a subset of the
parameters in the attempt to keep their respective training sets private.
Parameters can also be obfuscated via differential privacy (DP) to make
information extraction even more challenging, as proposed by Shokri and
Shmatikov at CCS'15.
Unfortunately, we show that any privacy-preserving collaborative deep
learning is susceptible to a powerful attack that we devise in this paper. In
particular, we show that a distributed, federated, or decentralized deep
learning approach is fundamentally broken and does not protect the training
sets of honest participants. The attack we developed exploits the real-time
nature of the learning process that allows the adversary to train a Generative
Adversarial Network (GAN) that generates prototypical samples of the targeted
training set that was meant to be private (the samples generated by the GAN are
intended to come from the same distribution as the training data).
Interestingly, we show that record-level DP applied to the shared parameters of
the model, as suggested in previous work, is ineffective (i.e., record-level DP
is not designed to address our attack).