These labels were automatically added by AI and may be inaccurate. For details, see About Literature Database.
Abstract
This paper assesses the performance of five machine learning classifiers:
Decision Tree, Naive Bayes, LightGBM, Logistic Regression, and Random Forest
using latent representations learned by a Variational Autoencoder from malware
datasets. Results from the experiments conducted on different training-test
splits with different random seeds reveal that all the models perform well in
detecting malware with ensemble methods (LightGBM and Random Forest) performing
slightly better than the rest. In addition, the use of latent features reduces
the computational cost of the model and the need for extensive hyperparameter
tuning for improved efficiency of the model for deployment. Statistical tests
show that these improvements are significant, and thus, the practical relevance
of integrating latent space representation with traditional classifiers for
effective malware detection in cybersecurity is established.