Privacy-preserving feature selection: A survey and proposing a new set of protocols

TOP 文献データベース Privacy-preserving feature selection: A survey and proposing a new set of protocols

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2008.07664

PDF

https://arxiv.org/pdf/2008.07664

文献情報

作者: Javad Rahimipour Anaraki,Saeed Samet
公開日: 2020-8-18
所属機関: Institute of Biomedical Engineering, University of Toronto
所属の国: Canada
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

透かし評価プライバシー保護データマイニング評価手法

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Feature selection is the process of sieving features, in which informative features are separated from the redundant and irrelevant ones. This process plays an important role in machine learning, data mining and bioinformatics. However, traditional feature selection methods are only capable of processing centralized datasets and are not able to satisfy today's distributed data processing needs. These needs require a new category of data processing algorithms called privacy-preserving feature selection, which protects users' data by not revealing any part of the data neither in the intermediate processing nor in the final results. This is vital for the datasets which contain individuals' data, such as medical datasets. Therefore, it is rational to either modify the existing algorithms or propose new ones to not only introduce the capability of being applied to distributed datasets, but also act responsibly in handling users' data by protecting their privacy. In this paper, we will review three privacy-preserving feature selection methods and provide suggestions to improve their performance when any gap is identified. We will also propose a privacy-preserving feature selection method based on the rough set feature selection. The proposed method is capable of processing both horizontally and vertically partitioned datasets in two- and multi-parties scenarios.