Explainability-Informed Targeted Malware Misclassification

TOP Literature Database Explainability-Informed Targeted Malware Misclassification

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2405.04010

PDF

https://arxiv.org/pdf/2405.04010

Paper Information

Author: Quincy Card;Kshitiz Aryal;Maanak Gupta
Published: 5-7-2024
Affiliation: Department of Computer Science, Tennessee Tech University
Country: United States of America
Conference: International Conference on Computer Communications and Networks (ICCCN)

Labels Estimated by AI

Poisoning Malware Classification Dynamic Analysis

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

In recent years, there has been a surge in malware attacks across critical infrastructures, requiring further research and development of appropriate response and remediation strategies in malware detection and classification. Several works have used machine learning models for malware classification into categories, and deep neural networks have shown promising results. However, these models have shown its vulnerabilities against intentionally crafted adversarial attacks, which yields misclassification of a malicious file. Our paper explores such adversarial vulnerabilities of neural network based malware classification system in the dynamic and online analysis environments. To evaluate our approach, we trained Feed Forward Neural Networks (FFNN) to classify malware categories based on features obtained from dynamic and online analysis environments. We use the state-of-the-art method, SHapley Additive exPlanations (SHAP), for the feature attribution for malware classification, to inform the adversarial attackers about the features with significant importance on classification decision. Using the explainability-informed features, we perform targeted misclassification adversarial white-box evasion attacks using the Fast Gradient Sign Method (FGSM) and Projected Gradient Descent (PGD) attacks against the trained classifier. Our results demonstrated high evasion rate for some instances of attacks, showing a clear vulnerability of a malware classifier for such attacks. We offer recommendations for a balanced approach and a benchmark for much-needed future research into evasion attacks against malware classifiers, and develop more robust and trustworthy solutions.

External Datasets

Dynamic Data Set

Online Data Set

References

2016 IEEE 40th Annual Computer Software and Applications Conference (COMPSAC)

Malware detection with deep neural network using process behavior

S. Tobiyama, Y. Yamaguchi, H. Shimada, T. Ikuse, T. Yagi

Published: 2016

10th International Conference on Communication and Network Security (ICCNS2020)

Didroid: Android malware classification and characterization using deep image learning

A. Rahali, et al.

Published: 2020

2021 Reconciling Data Analytics, Automation, Privacy, and Security: A Big Data Challenge (RDAAPS)

Entroplyzer: Android malware classification and characterization using entropy analysis of dynamic characteristics

D. S. Keyes, B. Li, G. Kaur, A. H. Lashkari, et al.

Published: 2021

arxiv

Cited by 1

Analyzing Machine Learning Approaches for Online Malware Detection in Cloud

Jeffrey C Kimmell, Mahmoud Abdelsalam, Maanak Gupta

Published: 5.20.2021

The variety of services and functionality offered by various cloud service providers (CSP) have exploded lately. Utilizing such services has created numerous opportunities for enterprises infrastructure to become cloud-based and, in turn, assisted the enterprises to easily and flexibly offer services to their customers. The practice of renting out access to servers to clients for computing and storage purposes is known as Infrastructure as a Service (IaaS). The popularity of IaaS has led to serious and critical concerns with respect to the cyber security and privacy. In particular, malware is often leveraged by malicious entities against cloud services to compromise sensitive data or to obstruct their functionality. In response to this growing menace, malware detection for cloud environments has become a widely researched topic with numerous methods being proposed and deployed. In this paper, we present online malware detection based on process level performance metrics, and analyze the effectiveness of different baseline machine learning models including, Support Vector Classifier (SVC), Random Forest Classifier (RFC), KNearest Neighbor (KNN), Gradient Boosted Classifier (GBC), Gaussian Naive Bayes (GNB) and Convolutional Neural Networks (CNN). Our analysis conclude that neural network models can most accurately detect the impact malware have on the process level features of virtual machines in the cloud, and therefore are best suited to detect them. Our models were trained, validated, and tested by using a dataset of 40,680 malicious and benign samples. The dataset was complied by running different families of malware (collected from VirusTotal) in a live cloud environment and collecting the process level features.

Online Malware Detection Malware Propagation Means Model Selection Method

Explaining vulnerabilities of deep learning to adversarial malware binaries

L. Demetrio, B. Biggio, G. Lagorio, F. Roli, A. Armando

Published: 2019

arXiv preprint

Efficient black-box optimization of adversarial windows malware with constrained manipulations

Computing Research Repository (CoRR)

COPYCAT: Practical Adversarial Attacks on Visualization-Based Malware Detection

Aminollah Khormali, Ahmed Abusnaina, Songqing Chen, DaeHun Nyang, Aziz Mohaisen

Published: 9.21.2019

Despite many attempts, the state-of-the-art of adversarial machine learning on malware detection systems generally yield unexecutable samples. In this work, we set out to examine the robustness of visualization-based malware detection system against adversarial examples (AEs) that not only are able to fool the model, but also maintain the executability of the original input. As such, we first investigate the application of existing off-the-shelf adversarial attack approaches on malware detection systems through which we found that those approaches do not necessarily maintain the functionality of the original inputs. Therefore, we proposed an approach to generate adversarial examples, COPYCAT, which is specifically designed for malware detection systems considering two main goals; achieving a high misclassification rate and maintaining the executability and functionality of the original input. We designed two main configurations for COPYCAT, namely AE padding and sample injection. While the first configuration results in untargeted misclassification attacks, the sample injection configuration is able to force the model to generate a targeted output, which is highly desirable in the malware attribution setting. We evaluate the performance of COPYCAT through an extensive set of experiments on two malware datasets, and report that we were able to generate adversarial samples that are misclassified at a rate of 98.9% and 96.5% with Windows and IoT binary datasets, respectively, outperforming the misclassification rates in the literature. Most importantly, we report that those AEs were executable unlike AEs generated by off-the-shelf approaches. Our transferability study demonstrates that the generated AEs through our proposed method can be generalized to other models.

Adversarial Example Adversarial attack Poisoning

Computers & Security

Optimization of code caves in malware binaries to evade machine learning detectors

J. Yuste, E. G. Pardo, J. Tapiador

Published: 2022

Proceedings of the Thirteenth ACM Conference on Data and Application Security and Privacy

Exploiting windows pe structure for adversarial malware evasion attacks

K. Aryal, M. Gupta, M. Abdelsalam

Published: 2023

Intra-section code cave injection for adversarial evasion attacks on windows pe malware file

K. Aryal, M. Gupta, M. Abdelsalam, M. Saleh

Published: 2024

arXiv

A survey on adversarial attacks for malware analysis

Kshitiz Aryal, Maanak Gupta, Mahmoud Abdelsalam

Published: 2021

2018 IEEE Military Communications Conference (MILCOM)

Attack and defense of dynamic analysis-based, adversarial neural malware detection models

J. W. Stokes, et al.

Published: 2018

Proceedings of the tenth ACM conference on data and application security and privacy

Deceiving portable executable malware classifiers into targeted misclassification with practical adversarial examples

Y. Kucuk, G. Yan

Published: 2020

Computers and Electrical Engineering

Mitigating adversarial evasion attacks of ransomware using ensemble learning

U. Ahmed, J. C.-W. Lin, G. Srivastava

Published: 2022

IEEE Transactions on Industrial Informatics

Mitigating malicious adversaries evasion attacks in industrial internet of things

H. Rafiq, et al.

Published: 2023

arxiv

Cited by 1

NIPS

A Unified Approach to Interpreting Model Predictions

Scott Lundberg, Su-In Lee

Published: 5.23.2017

Understanding why a model makes a certain prediction can be as crucial as the prediction's accuracy in many applications. However, the highest accuracy for large modern datasets is often achieved by complex models that even experts struggle to interpret, such as ensemble or deep learning models, creating a tension between accuracy and interpretability. In response, various methods have recently been proposed to help users interpret the predictions of complex models, but it is often unclear how these methods are related and when one method is preferable over another. To address this problem, we present a unified framework for interpreting predictions, SHAP (SHapley Additive exPlanations). SHAP assigns each feature an importance value for a particular prediction. Its novel components include: (1) the identification of a new class of additive feature importance measures, and (2) theoretical results showing there is a unique solution in this class with a set of desirable properties. The new class unifies six existing methods, notable because several recent methods in the class lack the proposed desirable properties. Based on insights from this unification, we present new methods that show improved computational performance and/or better consistency with human intuition than previous approaches.

Feature Importance Analysis XAI (Explainable AI) Deep Learning Method

CCCS-CIC-AndMal2020

C. C. for Cyber Security

Published: 2020

Proceedings of the 31st ACM International Conference on Information & Knowledge Management

Radar: A real-world dataset for ai powered run-time detection of cyber-attacks

S. Karapoola, N. Singh, C. Rebeiro, K. V.

Published: 2022

ICLR

Explaining and harnessing adversarial examples

Goodfellow, I. J., Shlens, J., Szegedy, C.

Published: 2015

arxiv

Cited by 4

Towards Deep Learning Models Resistant to Adversarial Attacks

Aleksander Madry, Aleksandar Makelov, Ludwig Schmidt, Dimitris Tsipras, Adrian Vladu

Published: 6.20.2017

Recent work has demonstrated that deep neural networks are vulnerable to adversarial examples---inputs that are almost indistinguishable from natural data and yet classified incorrectly by the network. In fact, some of the latest findings suggest that the existence of adversarial attacks may be an inherent weakness of deep learning models. To address this problem, we study the adversarial robustness of neural networks through the lens of robust optimization. This approach provides us with a broad and unifying view on much of the prior work on this topic. Its principled nature also enables us to identify methods for both training and attacking neural networks that are reliable and, in a certain sense, universal. In particular, they specify a concrete security guarantee that would protect against any adversary. These methods let us train networks with significantly improved resistance to a wide range of adversarial attacks. They also suggest the notion of security against a first-order adversary as a natural and broad security guarantee. We believe that robustness against such well-defined classes of adversaries is an important stepping stone towards fully resistant deep learning models. Code and pre-trained models are available at https://github.com/MadryLab/mnist_challenge and https://github.com/MadryLab/cifar10_challenge.

Certified Robustness Adversarial Example Robustness Evaluation