Evaluating Efficacy of Model Stealing Attacks and Defenses on Quantum Neural Networks

TOP Literature Database Evaluating Efficacy of Model Stealing Attacks and Defenses on Quantum Neural Networks

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2402.11687

PDF

https://arxiv.org/pdf/2402.11687

Paper Information

Author: Satwik Kundu;Debarshi Kundu;Swaroop Ghosh
Published: 2-19-2024
Affiliation: The Pennsylvania State University
Country: United States of America
Conference: ACM Great Lakes Symposium on VLSI

Labels Estimated by AI

Defense Method Model Extraction Attack Dataset Generation

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Cloud hosting of quantum machine learning (QML) models exposes them to a range of vulnerabilities, the most significant of which is the model stealing attack. In this study, we assess the efficacy of such attacks in the realm of quantum computing. We conducted comprehensive experiments on various datasets with multiple QML model architectures. Our findings revealed that model stealing attacks can produce clone models achieving up to $0.9\times$ and $0.99\times$ clone test accuracy when trained using Top-$1$ and Top-$k$ labels, respectively ($k:$ num\_classes). To defend against these attacks, we leverage the unique properties of current noisy hardware and perturb the victim model outputs and hinder the attacker's training process. In particular, we propose: 1) hardware variation-induced perturbation (HVIP) and 2) hardware and architecture variation-induced perturbation (HAVIP). Although noise and architectural variability can provide up to $\sim16\%$ output obfuscation, our comprehensive analysis revealed that models cloned under noisy conditions tend to be resilient, suffering little to no performance degradation due to such obfuscations. Despite limited success with our defense techniques, this outcome has led to an important discovery: QML models trained on noisy hardwares are naturally resistant to perturbation or obfuscation-based defenses or attacks.

External Datasets

MNIST-4

Fashion-4

Kuzushiji-4

Letters-4