Privacy Preserving Analytics on Distributed Medical Data

TOP 文献データベース Privacy Preserving Analytics on Distributed Medical Data

Computing Research Repository (CoRR)

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1806.06477

PDF

https://arxiv.org/pdf/1806.06477

文献情報

作者: Marina Blanton,Ah Reum Kang,Subhadeep Karan,Jaroslaw Zola
公開日: 2025-3-25
所属機関: Department of Computer Science and Engineering, University at Buffalo
所属の国: United States of America
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

差分プライバシープライバシー保護手法データ前処理

Abstract

Objective: To enable privacy-preserving learning of high quality generative and discriminative machine learning models from distributed electronic health records. Methods and Results: We describe general and scalable strategy to build machine learning models in a provably privacy-preserving way. Compared to the standard approaches using, e.g., differential privacy, our method does not require alteration of the input biomedical data, works with completely or partially distributed datasets, and is resilient as long as the majority of the sites participating in data processing are trusted to not collude. We show how the proposed strategy can be applied on distributed medical records to solve the variables assignment problem, the key task in exact feature selection and Bayesian networks learning. Conclusions: Our proposed architecture can be used by health care organizations, spanning providers, insurers, researchers and computational service providers, to build robust and high quality predictive models in cases where distributed data has to be combined without being disclosed, altered or otherwise compromised.