RelaxLoss: Defending Membership Inference Attacks without Losing Utility

TOP 文献データベース RelaxLoss: Defending Membership Inference Attacks without Losing Utility

International Conference on Learning Representations (ICLR)

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2207.05801

PDF

https://arxiv.org/pdf/2207.05801

文献情報

作者: Dingfan Chen;Ning Yu;Mario Fritz
公開日: 2025-3-19
所属機関: CISPA Helmholtz Center for Information Security
所属の国: Germany
会議名: International Conference on Learning Representations (ICLR)

AIにより推定されたラベル

モデル性能評価メンバーシップ推論損失項

Abstract

As a long-term threat to the privacy of training data, membership inference attacks (MIAs) emerge ubiquitously in machine learning models. Existing works evidence strong connection between the distinguishability of the training and testing loss distributions and the model's vulnerability to MIAs. Motivated by existing results, we propose a novel training framework based on a relaxed loss with a more achievable learning target, which leads to narrowed generalization gap and reduced privacy leakage. RelaxLoss is applicable to any classification model with added benefits of easy implementation and negligible overhead. Through extensive evaluations on five datasets with diverse modalities (images, medical data, transaction records), our approach consistently outperforms state-of-the-art defense mechanisms in terms of resilience against MIAs as well as model utility. Our defense is the first that can withstand a wide range of attacks while preserving (or even improving) the target model's utility. Source code is available at https://github.com/DingfanChen/RelaxLoss