PPaaS: Privacy Preservation as a Service

TOP 文献データベース PPaaS: Privacy Preservation as a Service

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2007.02013

PDF

https://arxiv.org/pdf/2007.02013

文献情報

作者: Pathum Chamikara Mahawaga Arachchige;Peter Bertok;Ibrahim Khalil;Dongxi Liu;Seyit Camtepe
公開日: 2020-7-4
更新日: 2021-4-21
所属機関: RMIT University
所属の国: Australia
会議名

AIにより推定されたラベル

プライバシー評価データの隠蔽 PPaaSのデータサニタイズ

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Personally identifiable information (PII) can find its way into cyberspace through various channels, and many potential sources can leak such information. Data sharing (e.g. cross-agency data sharing) for machine learning and analytics is one of the important components in data science. However, due to privacy concerns, data should be enforced with strong privacy guarantees before sharing. Different privacy-preserving approaches were developed for privacy preserving data sharing; however, identifying the best privacy-preservation approach for the privacy-preservation of a certain dataset is still a challenge. Different parameters can influence the efficacy of the process, such as the characteristics of the input dataset, the strength of the privacy-preservation approach, and the expected level of utility of the resulting dataset (on the corresponding data mining application such as classification). This paper presents a framework named \underline{P}rivacy \underline{P}reservation \underline{a}s \underline{a} \underline{S}ervice (PPaaS) to reduce this complexity. The proposed method employs selective privacy preservation via data perturbation and looks at different dynamics that can influence the quality of the privacy preservation of a dataset. PPaaS includes pools of data perturbation methods, and for each application and the input dataset, PPaaS selects the most suitable data perturbation approach after rigorous evaluation. It enhances the usability of privacy-preserving methods within its pool; it is a generic platform that can be used to sanitize big data in a granular, application-specific manner by employing a suitable combination of diverse privacy-preserving algorithms to provide a proper balance between privacy and utility.

外部データセット

Wholesale customers

Wine Quality

Page Blocks Classification

Letter Recognition

Statlog (Shuttle)

HEPMASS

HIGGS