Training Set Reconstruction from Differentially Private Forests: How Effective is DP? | AI Security Portal

JA

JA

EN

TOP Literature Database Training Set Reconstruction from Differentially Private Forests: How Effective is DP?

arxiv

Training Set Reconstruction from Differentially Private Forests: How Effective is DP?

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2502.05307

PDF

https://arxiv.org/pdf/2502.05307

Paper Information

Author: Alice Gorgé,Julien Ferry,Sébastien Gambs,Thibaut Vidal
Published: 2-8-2025
Updated: 9-25-2025
Affiliation: École Polytechnique, Palaiseau
Country: France
Conference: Computing Research Repository (CoRR)

Labels Estimated by AI

Reconfiguration Algorithm Differential Privacy Privacy Risk Management

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Recent research has shown that structured machine learning models such as tree ensembles are vulnerable to privacy attacks targeting their training data. To mitigate these risks, differential privacy (DP) has become a widely adopted countermeasure, as it offers rigorous privacy protection. In this paper, we introduce a reconstruction attack targeting state-of-the-art $\epsilon$-DP random forests. By leveraging a constraint programming model that incorporates knowledge of the forest's structure and DP mechanism characteristics, our approach formally reconstructs the most likely dataset that could have produced a given forest. Through extensive computational experiments, we examine the interplay between model utility, privacy guarantees and reconstruction accuracy across various configurations. Our results reveal that random forests trained with meaningful DP guarantees can still leak portions of their training data. Specifically, while DP reduces the success of reconstruction attacks, the only forests fully robust to our attack exhibit predictive performance no better than a constant classifier. Building on these insights, we also provide practical recommendations for the construction of DP random forests that are more resilient to reconstruction attacks while maintaining a non-trivial predictive performance.

External Datasets

COMPAS

UCI Adult Income

Default of Credit Card Clients