Leakage of Dataset Properties in Multi-Party Machine Learning

TOP 文献データベース Leakage of Dataset Properties in Multi-Party Machine Learning

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2006.07267

PDF

https://arxiv.org/pdf/2006.07267

文献情報

作者: Wanrong Zhang;Shruti Tople;Olga Ohrimenko
公開日: 2020-6-13
更新日: 2021-6-18
所属機関: Georgia Institute of Technology
所属の国: United States of America
会議名

AIにより推定されたラベル

メンバーシップ推論攻撃タイププライバシー損失分析

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Secure multi-party machine learning allows several parties to build a model on their pooled data to increase utility while not explicitly sharing data with each other. We show that such multi-party computation can cause leakage of global dataset properties between the parties even when parties obtain only black-box access to the final model. In particular, a ``curious'' party can infer the distribution of sensitive attributes in other parties' data with high accuracy. This raises concerns regarding the confidentiality of properties pertaining to the whole dataset as opposed to individual data records. We show that our attack can leak population-level properties in datasets of different types, including tabular, text, and graph data. To understand and measure the source of leakage, we consider several models of correlation between a sensitive attribute and the rest of the data. Using multiple machine learning models, we show that leakage occurs even if the sensitive attribute is not included in the training data and has a low correlation with other attributes or the target variable.