Raze to the Ground: Query-Efficient Adversarial HTML Attacks on Machine-Learning Phishing Webpage Detectors

TOP Literature Database Raze to the Ground: Query-Efficient Adversarial HTML Attacks on Machine-Learning Phishing Webpage Detectors

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2310.03166

PDF

https://arxiv.org/pdf/2310.03166

Paper Information

Author: Biagio Montaruli;Luca Demetrio;Maura Pintor;Luca Compagna;Davide Balzarotti;Battista Biggio
Published: 10-5-2023
Updated: 10-14-2023
Affiliation: SAP Security Research & EURECOM
Country: France
Conference: AISec@CCS

Labels Estimated by AI

Poisoning Phishing Machine Learning Method

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Machine-learning phishing webpage detectors (ML-PWD) have been shown to suffer from adversarial manipulations of the HTML code of the input webpage. Nevertheless, the attacks recently proposed have demonstrated limited effectiveness due to their lack of optimizing the usage of the adopted manipulations, and they focus solely on specific elements of the HTML code. In this work, we overcome these limitations by first designing a novel set of fine-grained manipulations which allow to modify the HTML code of the input phishing webpage without compromising its maliciousness and visual appearance, i.e., the manipulations are functionality- and rendering-preserving by design. We then select which manipulations should be applied to bypass the target detector by a query-efficient black-box optimization algorithm. Our experiments show that our attacks are able to raze to the ground the performance of current state-of-the-art ML-PWD using just 30 queries, thus overcoming the weaker attacks developed in previous work, and enabling a much fairer robustness evaluation of ML-PWD.

External Datasets

DeltaPhish