SSCAE -- Semantic, Syntactic, and Context-aware natural language Adversarial Examples generator

TOP Literature Database SSCAE -- Semantic, Syntactic, and Context-aware natural language Adversarial Examples generator

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2403.11833

PDF

https://arxiv.org/pdf/2403.11833

Paper Information

Author: Javad Rafiei Asl;Mohammad H. Rafiei;Manar Alohaly;Daniel Takabi
Published: 3-18-2024
Affiliation: Department of Computer Science, Georgia State University
Country: United States of America
Conference: IEEE Trans. Dependable Secur. Comput.

Labels Estimated by AI

Evaluation Method Dynamic Threshold Calculation Adversarial Example

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Machine learning models are vulnerable to maliciously crafted Adversarial Examples (AEs). Training a machine learning model with AEs improves its robustness and stability against adversarial attacks. It is essential to develop models that produce high-quality AEs. Developing such models has been much slower in natural language processing (NLP) than in areas such as computer vision. This paper introduces a practical and efficient adversarial attack model called SSCAE for \textbf{S}emantic, \textbf{S}yntactic, and \textbf{C}ontext-aware natural language \textbf{AE}s generator. SSCAE identifies important words and uses a masked language model to generate an early set of substitutions. Next, two well-known language models are employed to evaluate the initial set in terms of semantic and syntactic characteristics. We introduce (1) a dynamic threshold to capture more efficient perturbations and (2) a local greedy search to generate high-quality AEs. As a black-box method, SSCAE generates humanly imperceptible and context-aware AEs that preserve semantic consistency and the source language's syntactical and grammatical requirements. The effectiveness and superiority of the proposed SSCAE model are illustrated with fifteen comparative experiments and extensive sensitivity analysis for parameter optimization. SSCAE outperforms the existing models in all experiments while maintaining a higher semantic consistency with a lower query number and a comparable perturbation rate.

External Datasets

YELP Polarity Review

IMDB Review

Rotten Tomatoes Movie Reviews

Stanford Sentiment Treebank Version 2

Stanford NLI

Multi-NLI (MNLI-Matched)

Multi-NLI (MNLI-Mismatched)