Hard-Label Cryptanalytic Extraction of Neural Network Models

TOP Literature Database Hard-Label Cryptanalytic Extraction of Neural Network Models

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2409.11646

PDF

https://arxiv.org/pdf/2409.11646

Paper Information

Author: Yi Chen;Xiaoyang Dong;Jian Guo;Yantian Shen;Anyu Wang;Xiaoyun Wang
Published: 9-18-2024
Affiliation: Institute for Advanced Study, Tsinghua University
Country: China
Conference

Labels Estimated by AI

Model Extraction Attack Attack Method Computational Complexity

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

The machine learning problem of extracting neural network parameters has been proposed for nearly three decades. Functionally equivalent extraction is a crucial goal for research on this problem. When the adversary has access to the raw output of neural networks, various attacks, including those presented at CRYPTO 2020 and EUROCRYPT 2024, have successfully achieved this goal. However, this goal is not achieved when neural networks operate under a hard-label setting where the raw output is inaccessible. In this paper, we propose the first attack that theoretically achieves functionally equivalent extraction under the hard-label setting, which applies to ReLU neural networks. The effectiveness of our attack is validated through practical experiments on a wide range of ReLU neural networks, including neural networks trained on two real benchmarking datasets (MNIST, CIFAR10) widely used in computer vision. For a neural network consisting of $10^5$ parameters, our attack only requires several hours on a single core.

External Datasets

MNIST

CIFAR10