ppAURORA: Privacy Preserving Area Under Receiver Operating Characteristic and Precision-Recall Curves

TOP 文献データベース ppAURORA: Privacy Preserving Area Under Receiver Operating Characteristic and Precision-Recall Curves

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2102.08788

PDF

https://arxiv.org/pdf/2102.08788

文献情報

作者: Ali Burak Ünal;Nico Pfeifer;Mete Akgün
公開日: 2021-2-17
更新日: 2023-6-16
所属機関: Medical Data Privacy Preserving Machine Learning (MDPPML), University of Tübingen
所属の国: Germany
会議名

AIにより推定されたラベル

ウォーターマーキングデータ保護手法モデル性能評価

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Computing an AUC as a performance measure to compare the quality of different machine learning models is one of the final steps of many research projects. Many of these methods are trained on privacy-sensitive data and there are several different approaches like $\epsilon$-differential privacy, federated machine learning and cryptography if the datasets cannot be shared or used jointly at one place for training and/or testing. In this setting, it can also be a problem to compute the global AUC, since the labels might also contain privacy-sensitive information. There have been approaches based on $\epsilon$-differential privacy to address this problem, but to the best of our knowledge, no exact privacy preserving solution has been introduced. In this paper, we propose an MPC-based solution, called ppAURORA, with private merging of individually sorted lists from multiple sources to compute the exact AUC as one could obtain on the pooled original test samples. With ppAURORA, the computation of the exact area under precision-recall and receiver operating characteristic curves is possible even when ties between prediction confidence values exist. We use ppAURORA to evaluate two different models predicting acute myeloid leukemia therapy response and heart disease, respectively. We also assess its scalability via synthetic data experiments. All these experiments show that we efficiently and privately compute the exact same AUC with both evaluation metrics as one can obtain on the pooled test samples in plaintext according to the semi-honest adversary setting.