SoK: Pragmatic Assessment of Machine Learning for Network Intrusion Detection

Nature Digital Medicine

Deep learning-enabled medical computer vision

A. Esteva

Published: 2021

IEEE Int. Symp. High-Perf. Comp. Arch.

Machine learning at Facebook: Understanding inference at the edge

C.-J. Wu

Published: 2019

IEEE Transactions on Neural Networks and Learning Systems

A survey of the usages of deep learning for natural language processing

Daniel W Otter, Julian R Medina, Jugal K Kalita

Published: 2020

ICML

Deep Speech 2: End-to-end speech recognition in English and Mandarin

D. Amodei

Published: 2016

Medical Image Analysis

A survey on deep learning in medical image analysis

G. Litjens, et al.

Published: 2017

arxiv

Cited by 1

Survey of Machine Learning Techniques for Malware Analysis

Daniele Ucci, Leonardo Aniello, Roberto Baldoni

Published: 10.23.2017

Coping with malware is getting more and more challenging, given their relentless growth in complexity and volume. One of the most common approaches in literature is using machine learning techniques, to automatically learn models and patterns behind such complexity, and to develop technologies to keep pace with malware evolution. This survey aims at providing an overview on the way machine learning has been used so far in the context of malware analysis in Windows environments, i.e. for the analysis of Portable Executables. We systematize surveyed papers according to their objectives (i.e., the expected output), what information about malware they specifically use (i.e., the features), and what machine learning techniques they employ (i.e., what algorithm is used to process the input and produce the output). We also outline a number of issues and challenges, including those concerning the used datasets, and identify the main current topical trends and how to possibly advance them. In particular, we introduce the novel concept of malware analysis economics, regarding the study of existing trade-offs among key metrics, such as analysis accuracy and economical costs.

Malware Detection Method Dynamic Analysis Analysis of Detection Methods

Artif. Intell. Review

Applicability of machine learning in spam and phishing email filtering: review and approaches

T. Gangavarapu

Published: 2020

arxiv

Cited by 1

USENIX Security Symposium

Dos and Don'ts of Machine Learning in Computer Security

Daniel Arp, Erwin Quiring, Feargus Pendlebury, Alexander Warnecke, Fabio Pierazzi, Christian Wressnegger, Lorenzo Cavallaro, Konrad Rieck

Published: 10.19.2020

With the growing processing power of computing systems and the increasing availability of massive datasets, machine learning algorithms have led to major breakthroughs in many different areas. This development has influenced computer security, spawning a series of work on learning-based security systems, such as for malware detection, vulnerability discovery, and binary code analysis. Despite great potential, machine learning in security is prone to subtle pitfalls that undermine its performance and render learning-based systems potentially unsuitable for security tasks and practical deployment. In this paper, we look at this problem with critical eyes. First, we identify common pitfalls in the design, implementation, and evaluation of learning-based security systems. We conduct a study of 30 papers from top-tier security conferences within the past 10 years, confirming that these pitfalls are widespread in the current security literature. In an empirical analysis, we further demonstrate how individual pitfalls can lead to unrealistic performance and interpretations, obstructing the understanding of the security problem at hand. As a remedy, we propose actionable recommendations to support researchers in avoiding or mitigating the pitfalls where possible. Furthermore, we identify open problems when applying machine learning in security and provide directions for further research.

Dataset evaluation Bias Spurious Correlation

IEEE Int. Conf. Inf. Fusion

Information Security Analysis as Data Fusion

M. De Shon

Published: 2019

SANS

Security Operations Center (SOC)

Published: 2021

31st USENIX Security Symposium (USENIX Security 22)

99% false positives: A qualitative study of SOC analysts’ perspectives on security alarms

Bushra A Alahmadi, Louise Axon, Ivan Martinovic

Published: 2022

ACM Conf. Comput. Commun. Secur.

Anomaly detection of web-based attacks

C. Kruegel, G. Vigna

Published: 2003

RAID

Anagram: A content anomaly detector resistant to mimicry attack

K. Wang

Published: 2006

J. Comp. Virology

Language models for detection of unknown attacks in network traffic

K. Rieck, P. Laskov

Published: 2007

2010 IEEE symposium on security and privacy

Outside the closed world: On using machine learning for network intrusion detection

R. Sommer, V. Paxson

Published: 2010

2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition CVPR

Imagenet: A large-scale hierarchical image database

J. Deng, W. Dong, R. Socher, L. Li, K. Li, L. Fei-Fei

Published: 2009

Int. Conf. Parallel Proces.

ImageNet training in minutes

Y. You

Published: 2018

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

Deep residual learning for image recognition

Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun

Published: 2016

AAAI Conf. Artif. Intell.

Improved knowledge distillation via teacher assistant

S. I. Mirzadeh

Published: 2020

Network and Distributed System Security Symposium (NDSS)

Drebin: Effective and explainable detection of android malware in your pocket

D. Arp, M. Spreitzenbarth, M. Hubner, H. Gascon, K. Rieck

Published: 2014

IEEE transactions on dependable and secure computing

Yes, machine learning can be more secure! a case study on android malware detection

Demontis, A., Melis, M., Biggio, B., Maiorca, D., Arp, D., Rieck, K., Corona, I., Giacinto, G., Roli, F.

Published: 2017

Ann. Comp. Secur. Appl. Conf.

Can we leverage predictive uncertainty to detect dataset shift and adversarial examples in android malware detection?

D. Li

Published: 2021

Proceedings on Privacy Enhancing Technologies

Less is more: A privacy-respecting android malware classifier using federated learning

R. Galvez

Published: 2021

J. Comp. Virology Hacking Tech.

The duplication issue within the DREBIN dataset

P. Irolla, A. Dey

Published: 2018

IEEE Trans. Depend. Sec. Comput.

Eight years of rider measurement in the android malware ecosystem

G. Suarez-Tangil, G. Stringhini

Published: 2020

ACM Transactions on Privacy and Security

A deep dive inside drebin: An explorative analysis beyond android malware detection scores

Nadia Daoudi, Kevin Allix, Tegawende François Bissyandé, Jacques Klein

Published: 2022

IEEE Communications Surveys Tutorials

A detailed investigation and analysis of using machine learning techniques for intrusion detection

P. Mishra, V. Varadharajan, U. Tupakula, E. S. Pilli

Published: 2019

arxiv

Cited by 1

IEEE Trans. Netw. Serv. Manag.

The Cross-evaluation of Machine Learning-based Network Intrusion Detection Systems

Giovanni Apruzzese, Luca Pajola, Mauro Conti

Published: 3.9.2022

Enhancing Network Intrusion Detection Systems (NIDS) with supervised Machine Learning (ML) is tough. ML-NIDS must be trained and evaluated, operations requiring data where benign and malicious samples are clearly labelled. Such labels demand costly expert knowledge, resulting in a lack of real deployments, as well as on papers always relying on the same outdated data. The situation improved recently, as some efforts disclosed their labelled datasets. However, most past works used such datasets just as a 'yet another' testbed, overlooking the added potential provided by such availability. In contrast, we promote using such existing labelled data to cross-evaluate ML-NIDS. Such approach received only limited attention and, due to its complexity, requires a dedicated treatment. We hence propose the first cross-evaluation model. Our model highlights the broader range of realistic use-cases that can be assessed via cross-evaluations, allowing the discovery of still unknown qualities of state-of-the-art ML-NIDS. For instance, their detection surface can be extended--at no additional labelling cost. However, conducting such cross-evaluations is challenging. Hence, we propose the first framework, XeNIDS, for reliable cross-evaluations based on Network Flows. By using XeNIDS on six well-known datasets, we demonstrate the concealed potential, but also the risks, of cross-evaluations of ML-NIDS.

Dataset Generation Generalization Performance Framework

EAI Int. Conf. Big Data Tech.

Netflow datasets for machine learning-based network intrusion detection systems

M. Sarhan

Published: 2021

ICISSp

Toward generating a new intrusion detection dataset and intrusion traffic characterization

Iman Sharafaldin, Arash Habibi Lashkari, Ali A Ghorbani

Published: 2018

Ieee Access

Deep learning approach for intelligent intrusion detection system

R. Vinayakumar, M. Alazab, K. Soman, P. Poornachandran, A. Al-Nemrat, S. Venkatraman

Published: 2019

IEEE T. Netw. Serv. Manag.

A new method for flow-based network intrusion detection using the inverse potts model

C. Pontes

Published: 2021

2021 IEEE Security and Privacy Workshops (SPW)

Troubleshooting an intrusion detection dataset: the cicids2017 case study

G. Engelen, V. Rimmer, W. Joosen

Published: 2021

ACM Workshop on Artificial Intelligence and Security (AISec)

INSOMNIA: towards concept-drift robustness in network intrusion detection

G. Andresini, F. Pendlebury, F. Pierazzi, C. Loglisci, A. Appice, L. Cavallaro

Published: 2021

GitHub

Source-code of this paper (GitHub)

Published: 2023

IEEE Wireless Comm.

Security in mobile ad hoc networks: challenges and solutions

H. Yang

Published: 2004

Journal of Network and Computer Applications

Intrusion detection system: A comprehensive review

H.-J. Liao, C.-H. Richard Lin, Y.-C. Lin, K.-Y. Tung

Published: 2013

IEEE Trans. Emerg. Topics Comp.

LEoNIDS: A low-latency and energy-efficient network-level intrusion detection system

N. Tsikoudis

Published: 2014

RFC

Internet security glossary, version 2

R. Shirey

Published: 2007

IEEE Network

Network intrusion detection

B. Mukherjee

Published: 1994

ACM Comp. Surv.

A survey on data-driven network intrusion detection

D. Chou, M. Jiang

Published: 2021

ACM Conf. Comp. Commun. Secur.

Enabling visual analytics via alert-driven attack graphs

A. Nadeem

Published: 2021

IEEE Comm. Surv. Tut.

Network Intrusion Detection for IoT security based on learning techniques

N. Chaabouni

Published: 2019

Elsevier Comp. Netw.

Spear SIEM: A security information and event management system for the smart grid

P. Radoglou-Grammatikis

Published: 2021

Elsevier Comp. Secur.

Improving SIEM alert metadata aggregation with a novel kill-chain based classification model

B. D. Bryant, H. Saiedian

Published: 2020

Ann. Comp. Secur. Appl. Conf.

Made: Security analytics for enterprise threat detection

A. Oprea

Published: 2018

IEEE Int. Conf. Intell. Secur. Inf.

A user-centric machine learning framework for cyber security operations center

C. Feng

Published: 2017

Cybersecur

Survey of intrusion detection systems: techniques, datasets, and challenges

Khraisat, A., et al.

Published: 2019

Proc. ACM Workshop Artif. Intell. Secur.

Network Anomaly Detection Using Transfer Learning Based on Auto-Encoders Loss Normalization

A. Yehezkel

Published: 2021

IEEE Symp. Series Comp. Intell.

Near-real-time Anomaly Detection in Encrypted Traffic using Machine Learning Techniques

D. Ucci

Published: 2021

Cisco

Cisco IOS NetFlow

Published: 2021

IEEE Communications Surveys & Tutorials

Why are my flows different? a tutorial on flow exporters

G. Vormayr, J. Fabini, T. Zseby

Published: 2020

Int. Conf. Availability, Reliability, Secur.

On the evaluation of sequential machine learning for network intrusion detection

A. Corsini

Published: 2021

ACM Workshop Artif. Intell. Secur.

A Framework for Cluster and Classifier Evaluation in the Absence of Reference Labels

R. J. Joyce

Published: 2021

IEEE Communications Surveys & Tutorials

A survey of data mining and machine learning methods for cyber security intrusion detection

A. L. Buczak, E. Guven

Published: 2016

IEEE Int. Conf. Cyber Conflicts

On the effectiveness of machine and deep learning for cybersecurity

G. Apruzzese

Published: 2018

Applied Sciences

Machine learning and deep learning methods for intrusion detection systems: A survey

Liu, H., Lang, B.

Published: 2019

Computers & Security

A survey of network-based intrusion detection data sets

Ring, M., Wunderlich, S., Scheuring, D., Landes, D., Hotho, A.

Published: 2019

Proc. IEEE Int. Symp. Netw. Comput. Appl.

Identifying malicious hosts involved in periodic communications

G. Apruzzese

Published: 2017

ArXiv

Kitsune: An ensemble of autoencoders for online network intrusion detection

Y. Mirsky, T. Doitshman, Y. Elovici, A. Shabtai

Published: 2018

IEEE Symp. Secur. Privacy

Deepcase: Semi-supervised contextual analysis of security events

T. Van Ede

Published: 2022

Advances in Neural Information Processing Systems

Realistic evaluation of deep semi-supervised learning algorithms

Avital Oliver, Augustus Odena, Colin A Raffel, Ekin Dogus Cubuk, Ian Goodfellow

Published: 2018

NATO CCD COE Publications

Key concepts in cyber security: Towards a common policy and technology context for cyber security norms

C. Vishik

Published: 2016

IEEE Access

Some fundamental cybersecurity concepts

K. S. Wilson, M. A. Kiy

Published: 2014

IEEE IT Professional

Economics of artificial intelligence in cybersecurity

N. Kshetri

Published: 2021

J. Inf. Secur. Appl.

Stakeholder perspectives and requirements on cybersecurity in Europe

S. Fischer-Hubner

Published: 2021

IEEE Comm. Magazine

A security monitoring plane for named data networking deployment

T. Nguyen

Published: 2018

Technical report

Machine Learning in the Age of Cyber AI

Published: 2020

Technical report

Using AI to detect and contain Cyberthreats

Published: 2019

TU Delft – PhD Dissertation

One-class classification: Concept learning in the absence of counter-examples

D. M. J. Tax

Published: 2002

Elsevier Int. J. Critical Infrastructure Protection

The economics of cybersecurity: Principles and policy options

T. Moore

Published: 2010

Digital Threats: Research and Practice (DTRAP)

Modeling realistic adversarial attacks against network intrusion detection systems

G. Apruzzese, M. Andreolini, L. Ferretti, M. Marchetti, M. Colajanni

Published: 2022

ACM Comp. Surv.

Challenges in deploying machine learning: a survey of case studies

A. Paleyes

Published: 2022

IEEE Communications Surveys & Tutorials

Towards the deployment of machine learning solutions in network traffic classification: A systematic survey

F. Pacheco, E. Exposito, M. Gineste, C. Baudoin, J. Aguilar

Published: 2018

arxiv

Cited by 1

Digital Threats: Research and Practice (DTRAP)

The Role of Machine Learning in Cybersecurity

Giovanni Apruzzese, Pavel Laskov, Edgardo Montes de Oca, Wissam Mallouli, Luis Burdalo Rapa, Athanasios Vasileios Grammatopoulos, Fabio Di Franco

Published: 6.20.2022

Machine Learning (ML) represents a pivotal technology for current and future information systems, and many domains already leverage the capabilities of ML. However, deployment of ML in cybersecurity is still at an early stage, revealing a significant discrepancy between research and practice. Such discrepancy has its root cause in the current state-of-the-art, which does not allow to identify the role of ML in cybersecurity. The full potential of ML will never be unleashed unless its pros and cons are understood by a broad audience. This paper is the first attempt to provide a holistic understanding of the role of ML in the entire cybersecurity domain -- to any potential reader with an interest in this topic. We highlight the advantages of ML with respect to human-driven detection methods, as well as the additional tasks that can be addressed by ML in cybersecurity. Moreover, we elucidate various intrinsic problems affecting real ML deployments in cybersecurity. Finally, we present how various stakeholders can contribute to future developments of ML in cybersecurity, which is essential for further progress in this field. Our contributions are complemented with two real case studies describing industrial applications of ML as defense against cyber-threats.

Adversarial Example Role of Machine Learning Issues with Commercial ML Products

USENIX Security Symposium

Transcend: Detecting concept drift in malware classification models

R. Jordaney, K. Sharad, S. K. Dash, Z. Wang, D. Papini, I. Nouretdinov, L. Cavallaro

Published: 2017

Inf. Sci. (Ny)

Adversarial attacks against intrusion detection systems: Taxonomy, solutions and open issues

I. Corona, G. Giacinto, F. Roli

Published: 2013

Int. Req. Eng. Conf. Workshops

Requirements engineering for machine learning: Perspectives from data scientists

A. Vogelsang

Published: 2019

arxiv

Cited by 1

European Symposium on Security and Privacy (EuroS&P)

SoK: The Impact of Unlabelled Data in Cyberthreat Detection

Giovanni Apruzzese, Pavel Laskov, Aliya Tastemirova

Published: 5.18.2022

Machine learning (ML) has become an important paradigm for cyberthreat detection (CTD) in the recent years. A substantial research effort has been invested in the development of specialized algorithms for CTD tasks. From the operational perspective, however, the progress of ML-based CTD is hindered by the difficulty in obtaining the large sets of labelled data to train ML detectors. A potential solution to this problem are semisupervised learning (SsL) methods, which combine small labelled datasets with large amounts of unlabelled data. This paper is aimed at systematization of existing work on SsL for CTD and, in particular, on understanding the utility of unlabelled data in such systems. To this end, we analyze the cost of labelling in various CTD tasks and develop a formal cost model for SsL in this context. Building on this foundation, we formalize a set of requirements for evaluation of SsL methods, which elucidates the contribution of unlabelled data. We review the state-of-the-art and observe that no previous work meets such requirements. To address this problem, we propose a framework for assessing the benefits of unlabelled data in SsL. We showcase an application of this framework by performing the first benchmark evaluation that highlights the tradeoffs of 9 existing SsL methods on 9 public datasets. Our findings verify that, in some cases, unlabelled data provides a small, but statistically significant, performance gain. This paper highlights that SsL in CTD has a lot of room for improvement, which should stimulate future research in this field.

Dataset evaluation Membership Inference Performance Evaluation

IEEE T. Neural Netw. Learn. Syst.

Classification in the presence of label noise: a survey

B. Frenay, M. Verleysen

Published: 2013

MITRE

MITRE CALDERA

Published: 2023

IEEE T. Netw. Serv. Manag.

Network Intrusion Detection and Comparative Analysis using Ensemble Machine Learning and Feature Selection

S. Das

Published: 2021

Int. Workshop Multiple Classifier Syst.

One-and-a-half-class multiple classifier systems for secure learning against evasion attacks at test time

B. Biggio

Published: 2015

J. Supercomput.

An efficient cascaded method for network intrusion detection based on extreme learning machines

Y. Yu

Published: 2018

Computer Networks

Internet of things: A survey on machine learning-based intrusion detection approaches

K. A. Da Costa, J. P. Papa, C. O. Lisboa, R. Munoz, V. H. C. de Albuquerque

Published: 2019

IEEE Access

AI-IDS: Application of deep learning to real-time Web intrusion detection

A. Kim

Published: 2020

arxiv

Cited by 1

Reviewer Integration and Performance Measurement for Malware Detection

Brad Miller, Alex Kantchelian, Michael Carl Tschantz, Sadia Afroz, Rekha Bachwani, Riyaz Faizullabhoy, Ling Huang, Vaishaal Shankar, Tony Wu, George Yiu, Anthony D. Joseph, J. D. Tygar

Published: 10.26.2015

We present and evaluate a large-scale malware detection system integrating machine learning with expert reviewers, treating reviewers as a limited labeling resource. We demonstrate that even in small numbers, reviewers can vastly improve the system's ability to keep pace with evolving threats. We conduct our evaluation on a sample of VirusTotal submissions spanning 2.5 years and containing 1.1 million binaries with 778GB of raw feature data. Without reviewer assistance, we achieve 72% detection at a 0.5% false positive rate, performing comparable to the best vendors on VirusTotal. Given a budget of 80 accurate reviews daily, we improve detection to 89% and are able to detect 42% of malicious binaries undetected upon initial submission to VirusTotal. Additionally, we identify a previously unnoticed temporal inconsistency in the labeling of training datasets. We compare the impact of training labels obtained at the same time training data is first seen with training labels obtained months later. We find that using training labels obtained well after samples appear, and thus unavailable in practice for current training data, inflates measured detection by almost 20 percentage points. We release our cluster-based implementation, as well as a list of all hashes in our evaluation and 3% of our entire dataset.

Model Performance Evaluation Malicious Binary Selection Data Collection

Elsevier Neurocomputing

FP-ELM: An online sequential learning algorithm for dealing with Concept Drift

D. Liu

Published: 2016

arxiv

Cited by 12

Pattern Recognit.

Wild Patterns: Ten Years After the Rise of Adversarial Machine Learning

Battista Biggio, Fabio Roli

Published: 12.9.2017

Learning-based pattern classifiers, including deep networks, have shown impressive performance in several application domains, ranging from computer vision to cybersecurity. However, it has also been shown that adversarial input perturbations carefully crafted either at training or at test time can easily subvert their predictions. The vulnerability of machine learning to such wild patterns (also referred to as adversarial examples), along with the design of suitable countermeasures, have been investigated in the research field of adversarial machine learning. In this work, we provide a thorough overview of the evolution of this research area over the last ten years and beyond, starting from pioneering, earlier work on the security of non-deep learning algorithms up to more recent work aimed to understand the security properties of deep learning algorithms, in the context of computer vision and cybersecurity tasks. We report interesting connections between these apparently-different lines of work, highlighting common misconceptions related to the security evaluation of machine-learning algorithms. We review the main threat models and attacks defined to this end, and discuss the main limitations of current work, along with the corresponding future challenges towards the design of more secure learning algorithms.