SoK: Data Reconstruction Attacks Against Machine Learning Models: Definition, Metrics, and Benchmark

TOP 文献データベース SoK: Data Reconstruction Attacks Against Machine Learning Models: Definition, Metrics, and Benchmark

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2506.07888

PDF

https://arxiv.org/pdf/2506.07888

文献情報

作者: Rui Wen,Yiyong Liu,Michael Backes,Yang Zhang
公開日: 2025-6-11
所属機関: Institute of Science Tokyo
所属の国: Japan
会議名

AIにより推定されたラベル

モデルDoS 再構成アルゴリズム評価メトリクス

Abstract

Data reconstruction attacks, which aim to recover the training dataset of a target model with limited access, have gained increasing attention in recent years. However, there is currently no consensus on a formal definition of data reconstruction attacks or appropriate evaluation metrics for measuring their quality. This lack of rigorous definitions and universal metrics has hindered further advancement in this field. In this paper, we address this issue in the vision domain by proposing a unified attack taxonomy and formal definitions of data reconstruction attacks. We first propose a set of quantitative evaluation metrics that consider important criteria such as quantifiability, consistency, precision, and diversity. Additionally, we leverage large language models (LLMs) as a substitute for human judgment, enabling visual evaluation with an emphasis on high-quality reconstructions. Using our proposed taxonomy and metrics, we present a unified framework for systematically evaluating the strengths and limitations of existing attacks and establishing a benchmark for future research. Empirical results, primarily from a memorization perspective, not only validate the effectiveness of our metrics but also offer valuable insights for designing new attacks.