Model-agnostic clean-label backdoor mitigation in cybersecurity environments

TOP Literature Database Model-agnostic clean-label backdoor mitigation in cybersecurity environments

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2407.08159

PDF

https://arxiv.org/pdf/2407.08159

Paper Information

Author: Giorgio Severi,Simona Boboila,John Holodnak,Kendra Kratkiewicz,Rauf Izmailov,Michael J. De Lucia,Alina Oprea
Published: 7-11-2024
Updated: 5-5-2025
Affiliation: Northeastern University
Country: United States of America
Conference: IEEE Military Communications Conference (MILCOM)

Labels Estimated by AI

Backdoor Attack Backdoor Detection Defense Mechanism

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

The training phase of machine learning models is a delicate step, especially in cybersecurity contexts. Recent research has surfaced a series of insidious training-time attacks that inject backdoors in models designed for security classification tasks without altering the training labels. With this work, we propose new techniques that leverage insights in cybersecurity threat models to effectively mitigate these clean-label poisoning attacks, while preserving the model utility. By performing density-based clustering on a carefully chosen feature subspace, and progressively isolating the suspicious clusters through a novel iterative scoring procedure, our defensive mechanism can mitigate the attacks without requiring many of the common assumptions in the existing backdoor defense literature. To show the generality of our proposed mitigation, we evaluate it on two clean-label model-agnostic attacks on two different classic cybersecurity data modalities: network flows classification and malware classification, using gradient boosting and neural network models.

External Datasets

CTU-13

EMBER