Updates-Leak: Data Set Inference and Reconstruction Attacks in Online Learning

TOP 文献データベース Updates-Leak: Data Set Inference and Reconstruction Attacks in Online Learning

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1904.01067

PDF

https://arxiv.org/pdf/1904.01067

文献情報

作者: Ahmed Salem,Apratim Bhattacharya,Michael Backes,Mario Fritz,Yang Zhang
公開日: 2019-4-2
更新日: 2019-11-30
所属機関: CISPA Helmholtz Center for Information Security
所属の国: Germany
会議名

AIにより推定されたラベル

モデル抽出攻撃再構成攻撃敵対的攻撃検出

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Machine learning (ML) has progressed rapidly during the past decade and the major factor that drives such development is the unprecedented large-scale data. As data generation is a continuous process, this leads to ML model owners updating their models frequently with newly-collected data in an online learning scenario. In consequence, if an ML model is queried with the same set of data samples at two different points in time, it will provide different results. In this paper, we investigate whether the change in the output of a black-box ML model before and after being updated can leak information of the dataset used to perform the update, namely the updating set. This constitutes a new attack surface against black-box ML models and such information leakage may compromise the intellectual property and data privacy of the ML model owner. We propose four attacks following an encoder-decoder formulation, which allows inferring diverse information of the updating set. Our new attacks are facilitated by state-of-the-art deep learning techniques. In particular, we propose a hybrid generative model (CBM-GAN) that is based on generative adversarial networks (GANs) but includes a reconstructive loss that allows reconstructing accurate samples. Our experiments show that the proposed attacks achieve strong performance.