Elliptical modeling and pattern analysis for perturbation models and classfication

TOP 文献データベース Elliptical modeling and pattern analysis for perturbation models and classfication

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1710.07939

PDF

https://arxiv.org/pdf/1710.07939

文献情報

作者: Shan Suthaharan,Weining Shen
公開日: 2017-10-22
所属機関: Department of Computer Science, University of North Carolina at Greensboro
所属の国: United States of America
会議名: Int. J. Data Sci. Anal.

AIにより推定されたラベル

プライバシー保護機械学習モデル評価手法データプライバシー評価

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

The characteristics (or numerical patterns) of a feature vector in the transform domain of a perturbation model differ significantly from those of its corresponding feature vector in the input domain. These differences - caused by the perturbation techniques used for the transformation of feature patterns - degrade the performance of machine learning techniques in the transform domain. In this paper, we proposed a nonlinear parametric perturbation model that transforms the input feature patterns to a set of elliptical patterns, and studied the performance degradation issues associated with random forest classification technique using both the input and transform domain features. Compared with the linear transformation such as Principal Component Analysis (PCA), the proposed method requires less statistical assumptions and is highly suitable for the applications such as data privacy and security due to the difficulty of inverting the elliptical patterns from the transform domain to the input domain. In addition, we adopted a flexible block-wise dimensionality reduction step in the proposed method to accommodate the possible high-dimensional data in modern applications. We evaluated the empirical performance of the proposed method on a network intrusion data set and a biological data set, and compared the results with PCA in terms of classification performance and data privacy protection (measured by the blind source separation attack and signal interference ratio). Both results confirmed the superior performance of the proposed elliptical transformation.

外部データセット

NSL-KDD

dataset-O

dataset-I

dataset-T

dataset-IR

dataset-TR