Research Group on Intelligent Engineering and Computing for Advanced Innovation and Development (GECAD), Porto School of Engineering, Polytechnic of Porto (ISEP-IPP)
These labels were automatically added by AI and may be inaccurate. For details, see About Literature Database.
Abstract
Cybersecurity threats highlight the need for robust network intrusion
detection systems to identify malicious behaviour. These systems rely heavily
on large datasets to train machine learning models capable of detecting
patterns and predicting threats. In the past two decades, researchers have
produced a multitude of datasets, however, some widely utilised recent datasets
generated with CICFlowMeter contain inaccuracies. These result in flow
generation and feature extraction inconsistencies, leading to skewed results
and reduced system effectiveness. Other tools in this context lack ease of use,
customizable feature sets, and flow labelling options. In this work, we
introduce HERA, a new open-source tool that generates flow files and labelled
or unlabelled datasets with user-defined features. Validated and tested with
the UNSW-NB15 dataset, HERA demonstrated accurate flow and label generation.