Membership Inference Attacks From First Principles

TOP 文献データベース Membership Inference Attacks From First Principles

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2112.03570

PDF

https://arxiv.org/pdf/2112.03570

文献情報

作者: Nicholas Carlini;Steve Chien;Milad Nasr;Shuang Song;Andreas Terzis;Florian Tramer
公開日: 2021-12-7
更新日: 2022-4-13
所属機関: Google Research
所属の国: United States of America
会議名: SP

AIにより推定されたラベル

性能評価指標メンバーシップ推論プライバシーリスク管理

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

A membership inference attack allows an adversary to query a trained machine learning model to predict whether or not a particular example was contained in the model's training dataset. These attacks are currently evaluated using average-case "accuracy" metrics that fail to characterize whether the attack can confidently identify any members of the training set. We argue that attacks should instead be evaluated by computing their true-positive rate at low (e.g., <0.1%) false-positive rates, and find most prior attacks perform poorly when evaluated in this way. To address this we develop a Likelihood Ratio Attack (LiRA) that carefully combines multiple ideas from the literature. Our attack is 10x more powerful at low false-positive rates, and also strictly dominates prior attacks on existing metrics.