AIセキュリティポータル K Program
I can't see it but I can Fine-tune it: On Encrypted Fine-tuning of Transformers using Fully Homomorphic Encryption
Share
Abstract
In today's machine learning landscape, fine-tuning pretrained transformer models has emerged as an essential technique, particularly in scenarios where access to task-aligned training data is limited. However, challenges surface when data sharing encounters obstacles due to stringent privacy regulations or user apprehension regarding personal information disclosure. Earlier works based on secure multiparty computation (SMC) and fully homomorphic encryption (FHE) for privacy-preserving machine learning (PPML) focused more on privacy-preserving inference than privacy-preserving training. In response, we introduce BlindTuner, a privacy-preserving fine-tuning system that enables transformer training exclusively on homomorphically encrypted data for image classification. Our extensive experimentation validates BlindTuner's effectiveness by demonstrating comparable accuracy to non-encrypted models. Notably, our findings highlight a substantial speed enhancement of 1.5x to 600x over previous work in this domain.
(leveled) fully homomorphic encryption without bootstrapping
Z. Brakerski, C. Gentry, V. Vaikuntanathan
Published: 2014
Bootstrapping for approximate homomorphic encryption
J. H. Cheon, K. Han, A. Kim, M. Kim, Y. Song
Published: 2018
Homomorphic encryption for arithmetic of approximate numbers
Jung Hee Cheon, Andrey Kim, Miran Kim, Yongsoo Song
Published: 2017
Practical FHE parameters against lattice attacks
J. H. Cheon, Y. Son, D. Yhee
Published: 2021
The Rise of Fully Homomorphic Encryption: Often called the Holy Grail of cryptography, commercial FHE is near
M. Creeger
Published: 2022
A low-depth homomorphic circuit for logistic regression model training
E. Crockett
Published: 2020
Imagenet: A large-scale hierarchical image database
J. Deng, W. Dong, R. Socher, L. Li, K. Li, L. Fei-Fei
Published: 2009
A fully homomorphic encryption scheme
Craig Gentry
Published: 2009
Cryptonets: Applying neural networks to encrypted data with high throughput and accuracy
Ran Gilad-Bachrach, Nathan Dowlin, Kim Laine, Kristin Lauter, Michael Naehrig, John Wensing
Published: 2016
Foundations of cryptography, volume 2
Oded Goldreich
Published: 2004
ML confidential: Machine learning on encrypted data
T. Graepel, K. Lauter, M. Naehrig
Published: 2012
Logistic regression on homomorphic encrypted data at scale
K. Han, S. Hong, J. H. Cheon, D. Park
Published: 2019
Deep residual learning for image recognition
Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun
Published: 2016
Privacy-preserving machine learning as a service
E. Hesamifard, H. Takabi, M. Ghasemi, R. N. Wright
Published: 2018
Learning multiple layers of features from tiny images
Alex Krizhevsky, Geoffrey Hinton
Published: 2009
Face Mask Detection Dataset
Larxel
Published: 2020
Gradient-based learning applied to document recognition
Y. LeCun, L. Bottou, Y. Bengio, P. Haffner
Published: 1998
Hetal: efficient privacy-preserving transfer learning with homomorphic encryption
S. Lee, G. Lee, J. W. Kim, J. Shin, M.-K. Lee
Published: 2023
HomoPAI: A Secure Collaborative Machine Learning Platform based on Homomorphic Encryption
Q. Li, Z. Huang, W.-j. Lu, C. Hong, H. Qu, H. He, W. Zhang
Published: 2020
Glyph: Fast and accurately training deep neural networks on encrypted data
Qian Lou, Bo Feng, Geoffrey Charles Fox, Lei Jiang
Published: 2020
Towards deep neural network training on encrypted data
K. Nandakumar, N. Ratha, S. Pankanti, S. Halevi
Published: 2019
A method of solving a convex programming problem with convergence rate O(k^2)
Y. E. Nesterov
Published: 1983
A survey on transfer learning
Sinno Jialin Pan, Qiang Yang
Published: 2010
Very deep convolutional networks for large-scale image recognition
K. Simonyan, A. Zisserman
Published: 2015
Privacy preserving multi-party machine learning with homomorphic encryption
H. Takabi, E. Hesamifard, M. Ghasemi
Published: 2016
EfficientNet: Rethinking model scaling for convolutional neural networks
Mingxing Tan, Quoc Le
Published: 2019
The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions
P. Tschandl, C. Rosendahl, H. Kittler
Published: 2018
A survey of transfer learning
K. Weiss, T. M. Khoshgoftaar, D. Wang
Published: 2016
MedMNIST v2-A large-scale lightweight benchmark for 2D and 3D biomedical image classification
J. Yang, R. Shi, D. Wei, Z. Liu, L. Zhao, B. Ke, H. Pfister, B. Ni
Published: 2023
Share