Bounding Information Leakage in Machine Learning

TOP 文献データベース Bounding Information Leakage in Machine Learning

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2105.03875

PDF

https://arxiv.org/pdf/2105.03875

文献情報

作者: Ganesh Del Grosso;Georg Pichler;Catuscia Palamidessi;Pablo Piantanida
公開日: 2021-5-9
更新日: 2023-3-7
所属機関: Inria Saclay, team COMETE, 1 Rue Honore d'Estienne d'Orves, Palaiseau, 91120, Ile-de-France, France
所属の国: France
会議名: Neurocomputing

AIにより推定されたラベル

メンバーシップ推論ベイジアン敵対的学習

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Recently, it has been shown that Machine Learning models can leak sensitive information about their training data. This information leakage is exposed through membership and attribute inference attacks. Although many attack strategies have been proposed, little effort has been made to formalize these problems. We present a novel formalism, generalizing membership and attribute inference attack setups previously studied in the literature and connecting them to memorization and generalization. First, we derive a universal bound on the success rate of inference attacks and connect it to the generalization gap of the target model. Second, we study the question of how much sensitive information is stored by the algorithm about its training set and we derive bounds on the mutual information between the sensitive attributes and model parameters. Experimentally, we illustrate the potential of our approach by applying it to both synthetic data and classification tasks on natural images. Finally, we apply our formalism to different attribute inference strategies, with which an adversary is able to recover the identity of writers in the PenDigits dataset.