Differentially Private Shapley Values for Data Evaluation

TOP 文献データベース Differentially Private Shapley Values for Data Evaluation

Computing Research Repository (CoRR)

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2206.00511

PDF

https://arxiv.org/pdf/2206.00511

文献情報

作者: Lauren Watson;Rayna Andreeva;Hao-Tsung Yang;Rik Sarkar
公開日: 2022-6-1
所属機関: School of Informatics, University of Edinburgh
所属の国: United Kingdom
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

プライバシー評価損失項サンプル複雑性

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

The Shapley value has been proposed as a solution to many applications in machine learning, including for equitable valuation of data. Shapley values are computationally expensive and involve the entire dataset. The query for a point's Shapley value can also compromise the statistical privacy of other data points. We observe that in machine learning problems such as empirical risk minimization, and in many learning algorithms (such as those with uniform stability), a diminishing returns property holds, where marginal benefit per data point decreases rapidly with data sample size. Based on this property, we propose a new stratified approximation method called the Layered Shapley Algorithm. We prove that this method operates on small (O(\polylog(n))) random samples of data and small sized ($O(\log n)$) coalitions to achieve the results with guaranteed probabilistic accuracy, and can be modified to incorporate differential privacy. Experimental results show that the algorithm correctly identifies high-value data points that improve validation accuracy, and that the differentially private evaluations preserve approximate ranking of data.