AIセキュリティポータル K Program
Stealing the Invisible: Unveiling Pre-Trained CNN Models through Adversarial Examples and Timing Side-Channels
Share
Abstract
Machine learning, with its myriad applications, has become an integral component of numerous technological systems. A common practice in this domain is the use of transfer learning, where a pre-trained model's architecture, readily available to the public, is fine-tuned to suit specific tasks. As Machine Learning as a Service (MLaaS) platforms increasingly use pre-trained models in their backends, it's crucial to safeguard these architectures and understand their vulnerabilities. In this work, we present an approach based on the observation that the classification patterns of adversarial images can be used as a means to steal the models. Furthermore, the adversarial image classifications in conjunction with timing side channels can lead to a model stealing method. Our approach, designed for typical user-level access in remote MLaaS environments exploits varying misclassifications of adversarial images across different models to fingerprint several renowned Convolutional Neural Network (CNN) and Vision Transformer (ViT) architectures. We utilize the profiling of remote model inference times to reduce the necessary adversarial images, subsequently decreasing the number of queries required. We have presented our results over 27 pre-trained models of different CNN and ViT architectures using CIFAR-10 dataset and demonstrate a high accuracy of 88.8% while keeping the query budget under 20.
CSI–NN: Reverse engineering of neural network architectures through electromagnetic side channel
L. Batina, S. Bhasin, D. Jap, S. Picek
Published: 2019
Teacher Model Fingerprinting Attacks Against Transfer Learning
Yufei Chen, Chao Shen, Cong Wang, Yang Zhang
Published: 2021.6.24
On reverse engineering neural network implementation on GPU
Ł. Chmielewski, L. Weissbart
Published: 2021
Copycat CNN: Stealing Knowledge by Persuading Confession with Random Non-Labeled Data
Jacson Rodrigues Correia-Silva, Rodrigo F. Berriel, Claudine Badue, Alberto F. de Souza, Thiago Oliveira-Santos
Published: 2018.6.14
Deep residual learning for image recognition
Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun
Published: 2016
Stealing Links from Graph Neural Networks
Xinlei He, Jinyuan Jia, Michael Backes, Neil Zhenqiang Gong, Yang Zhang
Published: 2020.5.5
Reverse engineering convolutional neural networks through side-channel information leaks
W. Hua, Z. Zhang, G. E. Suh
Published: 2018
Densely connected convolutional networks
G. Huang, Z. Liu, L. Van Der Maaten, K. Q. Weinberger
Published: 2017
MAZE: Data-Free Model Stealing Attack Using Zeroth-Order Gradient Estimation
Sanjay Kariyappa, Atul Prakash, Moinuddin Qureshi
Published: 2020.5.7
Learning multiple layers of features from tiny images
Alex Krizhevsky, Geoffrey Hinton
Published: 2009
Imagenet classification with deep convolutional neural networks
Alex Krizhevsky, Ilya Sutskever, Geoffrey E Hinton
Published: 2012
Query-Efficient Black-Box Attack by Active Learning
Pengcheng Li, Jinfeng Yi, Lijun Zhang
Published: 2018.9.13
Delving into transferable adversarial examples and black-box attacks
Liu, Y., Chen, X., Liu, C., Song, D.
Published: 2017
Stealing Knowledge from Protected Deep Neural Networks Using Composite Unlabeled Data
Itay Mosafi, Eli David, Nathan S. Netanyahu
Published: 2019.12.9
I Know What You Trained Last Summer: A Survey on Stealing Machine Learning Models and Defences
Daryna Oliynyk, Rudolf Mayer, Andreas Rauber
Published: 2022.6.17
Activethief: Model extraction using active learning and unannotated public data
S. Pal, Y. Gupta, A. Shukla, A. Kanade, S. K. Shevade, V. Ganapathy
Published: 2020
Practical black-box attacks against machine learning
Papernot, N., McDaniel, P., Goodfellow, I., Jha, S., Celik, Z. B., Swami, A.
Published: 2017
Reverse-engineering deep relu networks
David Rolnick, et al.
Published: 2020
Mobilenetv2: Inverted residuals and linear bottlenecks
M. Sandler, A. Howard, M. Zhu, A. Zhmoginov, L.-C. Chen
Published: 2018
Very deep convolutional networks for large-scale image recognition
K. Simonyan, A. Zisserman
Published: 2015
Going deeper with convolutions
C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. E. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, A. Rabinovich
Published: 2015
Rethinking the inception architecture for computer vision
C. Szegedy, V. Vanhoucke, S. Ioffe, J. Shlens, Z. Wojna
Published: 2016
DAWN: Dynamic Adversarial Watermarking of Neural Networks
Sebastian Szyller, Buse Gul Atli, Samuel Marchal, N. Asokan
Published: 2019.6.3
Time to leak: Cross-device timing attack on edge deep learning accelerator
Yoo-Seung Won, Soham Chatterjee, Dirmanto Jap, Shivam Bhasin, Arindam Basu
Published: 2021
Visual transformers: Token-based image representation and processing for computer vision
Bichen Wu, Chenfeng Xu, Xiaoliang Dai, Alvin Wan, Peizhao Zhang, Zhicheng Yan, Masayoshi Tomizuka, Joseph Gonzalez, Kurt Keutzer, Peter Vajda
Published: 2020
Cache Telepathy: Leveraging Shared Resource Attacks to Learn DNN Architectures
Mengjia Yan, Christopher Fletcher, Josep Torrellas
Published: 2018.8.15
Share