Pre-trained Encoder Inference: Revealing Upstream Encoders In Downstream Machine Learning Services

TOP Literature Database Pre-trained Encoder Inference: Revealing Upstream Encoders In Downstream Machine Learning Services

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2408.02814

PDF

https://arxiv.org/pdf/2408.02814

Paper Information

Author: Shaopeng Fu;Xuexue Sun;Ke Qing;Tianhang Zheng;Di Wang
Published: 8-6-2024
Affiliation: King Abdullah University of Science and Technology
Country: Saudi Arabia
Conference: Computing Research Repository (CoRR)

Labels Estimated by AI

Membership Inference Attack Method Privacy Protection Method

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Though pre-trained encoders can be easily accessed online to build downstream machine learning (ML) services quickly, various attacks have been designed to compromise the security and privacy of these encoders. While most attacks target encoders on the upstream side, it remains unknown how an encoder could be threatened when deployed in a downstream ML service. This paper unveils a new vulnerability: the Pre-trained Encoder Inference (PEI) attack, which posts privacy threats toward encoders hidden behind downstream ML services. By only providing API accesses to a targeted downstream service and a set of candidate encoders, the PEI attack can infer which encoder is secretly used by the targeted service based on candidate ones. We evaluate the attack performance of PEI against real-world encoders on three downstream tasks: image classification, text classification, and text-to-image generation. Experiments show that the PEI attack succeeds in revealing the hidden encoder in most cases and seldom makes mistakes even when the hidden encoder is not in the candidate set. We also conducted a case study on one of the most recent vision-language models, LLaVA, to illustrate that the PEI attack is useful in assisting other ML attacks such as adversarial attacks. The code is available at https://github.com/fshp971/encoder-inference.

External Datasets

CIFAR-10

SVHN

STL-10

Food-101

SST-5

Yelp

AG-News

TREC

PASCAL VOC 2012