Gradient-Leaks: Understanding and Controlling Deanonymization in Federated Learning

TOP 文献データベース Gradient-Leaks: Understanding and Controlling Deanonymization in Federated Learning

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1805.05838

PDF

https://arxiv.org/pdf/1805.05838

文献情報

作者: Tribhuvanesh Orekondy,Seong Joon Oh,Yang Zhang,Bernt Schiele,Mario Fritz
公開日: 2018-5-16
更新日: 2020-9-14
所属機関: Max Planck Institute for Informatics
所属の国: Germany
会議名

AIにより推定されたラベル

プライバシー保護機械学習ポイズニングユーザー行動分析

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Federated Learning (FL) systems are gaining popularity as a solution to training Machine Learning (ML) models from large-scale user data collected on personal devices (e.g., smartphones) without their raw data leaving the device. At the core of FL is a network of anonymous user devices sharing training information (model parameter updates) computed locally on personal data. However, the type and degree to which user-specific information is encoded in the model updates is poorly understood. In this paper, we identify model updates encode subtle variations in which users capture and generate data. The variations provide a strong statistical signal, allowing an adversary to effectively deanonymize participating devices using a limited set of auxiliary data. We analyze resulting deanonymization attacks on diverse tasks on real-world (anonymized) user-generated data across a range of closed- and open-world scenarios. We study various strategies to mitigate the risks of deanonymization. As random perturbation methods do not offer convincing operating points, we propose data-augmentation strategies which introduces adversarial biases in device data and thereby, offer substantial protection against deanonymization threats with little effect on utility.

外部データセット

PIPA

OpenImages

Blog Authorship