Beta-CoRM: A Bayesian Approach for $n$-gram Profiles Analysis

TOP 文献データベース Beta-CoRM: A Bayesian Approach for $n$-gram Profiles Analysis

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2011.11558

PDF

https://arxiv.org/pdf/2011.11558

文献情報

作者: José A. Perusquía;Jim E. Griffin;Cristiano Villa
公開日: 2020-11-24
更新日: 2024-9-2
所属機関: Department of Mathematics, Faculty of Sciences, UNAM, Mexico
所属の国: Mexico
会議名

AIにより推定されたラベル

モデル性能評価生成モデル特性特徴エンジニアリング

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

$n$-gram profiles have been successfully and widely used to analyse long sequences of potentially differing lengths for clustering or classification. Mainly, machine learning algorithms have been used for this purpose but, despite their predictive performance, these methods cannot discover hidden structures or provide a full probabilistic representation of the data. A novel class of Bayesian generative models designed for $n$-gram profiles used as binary attributes have been designed to address this. The flexibility of the proposed modelling allows to consider a straightforward approach to feature selection in the generative model. Furthermore, a slice sampling algorithm is derived for a fast inferential procedure, which is applied to synthetic and real data scenarios and shows that feature selection can improve classification accuracy.