Machine learning models trained with differentially-private (DP) algorithms
such as DP-SGD enjoy resilience against a wide range of privacy attacks.
Although it is possible to derive bounds for some attacks based solely on an
$(\varepsilon,\delta)$-DP guarantee, meaningful bounds require a small enough
privacy budget (i.e., injecting a large amount of noise), which results in a
large loss in utility. This paper presents a new approach to evaluate the
privacy of machine learning models against specific record-level threats, such
as membership and attribute inference, without the indirection through DP. We
focus on the popular DP-SGD algorithm, and derive simple closed-form bounds.
Our proofs model DP-SGD as an information theoretic channel whose inputs are
the secrets that an attacker wants to infer (e.g., membership of a data record)
and whose outputs are the intermediate model parameters produced by iterative
optimization. We obtain bounds for membership inference that match
state-of-the-art techniques, whilst being orders of magnitude faster to
compute. Additionally, we present a novel data-dependent bound against
attribute inference. Our results provide a direct, interpretable, and practical
way to evaluate the privacy of trained models against specific inference
threats without sacrificing utility.
外部データセット
Adult Census Income
Purchase
参考文献
23rd ACM SIGSAC Conference on Computer and Communications Security, CCS 2016
Deep learning with differential privacy
Martín Abadi, Andy Chu, Ian Goodfellow, H Brendan McMahan, Ilya Mironov, Kunal Talwar, Li Zhang
Published: 2016
International Conference on Machine Learning
Improving the gaussian mechanism for differential privacy: Analytical calibration and optimal denoising
K. Chatzikokolakis, G. Cherubin, C. Palamidessi, C. Troncoso
Published: 2023
arxiv
被引用数 1
Proc. Priv. Enhancing Technol.
Bayes, not Naïve: Security Bounds on Website Fingerprinting Defenses
Giovanni Cherubin
Published: 2017.2.25
Website Fingerprinting (WF) attacks raise major concerns about users'
privacy. They employ Machine Learning (ML) to allow a local passive adversary
to uncover the Web browsing behavior of a user, even if she browses through an
encrypted tunnel (e.g. Tor, VPN). Numerous defenses have been proposed in the
past; however, it is typically difficult to have formal guarantees on their
security, which is most often evaluated empirically against state-of-the-art
attacks. In this paper, we present a practical method to derive security bounds
for any WF defense, which depend on a chosen feature set. This result derives
from reducing WF attacks to an ML classification task, where we can determine
the smallest achievable error (the Bayes error); such error can be estimated in
practice, and is a lower bound for a WF adversary, for any classification
algorithm he may use. Our work has two main consequences: i) it allows
determining the security of WF defenses, in a black-box manner, with respect to
the state-of-the-art feature set and ii) it favors shifting the focus of future
WF research to the identification of optimal feature sets. The generality of
the approach further suggests that the method could be used to define security
bounds for other ML-based attacks.