Leveraging VAE-Derived Latent Spaces for Enhanced Malware Detection with Machine Learning Classifiers

TOP Literature Database Leveraging VAE-Derived Latent Spaces for Enhanced Malware Detection with Machine Learning Classifiers

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2503.20803

PDF

https://arxiv.org/pdf/2503.20803

Paper Information

Author: Bamidele Ajayi,Basel Barakat,Ken McGarry
Published: 3-24-2025
Updated: 4-30-2025
Affiliation: School of Computer Science, University of Sunderland
Country: United Kingdom
Conference: Computing Research Repository (CoRR)

Labels Estimated by AI

Malware Classification Machine Learning Technology Factors of Performance Degradation

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

This paper assesses the performance of five machine learning classifiers: Decision Tree, Naive Bayes, LightGBM, Logistic Regression, and Random Forest using latent representations learned by a Variational Autoencoder from malware datasets. Results from the experiments conducted on different training-test splits with different random seeds reveal that all the models perform well in detecting malware with ensemble methods (LightGBM and Random Forest) performing slightly better than the rest. In addition, the use of latent features reduces the computational cost of the model and the need for extensive hyperparameter tuning for improved efficiency of the model for deployment. Statistical tests show that these improvements are significant, and thus, the practical relevance of integrating latent space representation with traditional classifiers for effective malware detection in cybersecurity is established.

External Datasets

EMBER

BODMAS