Private Transformer Inference in MLaaS: A Survey

TOP Literature Database Private Transformer Inference in MLaaS: A Survey

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2505.10315

PDF

https://arxiv.org/pdf/2505.10315

Paper Information

Author: Yang Li,Xinyu Zhou,Yitong Wang,Liangxin Qian,Jun Zhao
Published: 5-15-2025
Affiliation: Energy Research Institute @ NTU, Interdisciplinary Graduate Programme, Nanyang Technological University
Country: Singapore
Conference

Labels Estimated by AI

Encryption Technology Computational Consistency Machine Learning

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Transformer models have revolutionized AI, powering applications like content generation and sentiment analysis. However, their deployment in Machine Learning as a Service (MLaaS) raises significant privacy concerns, primarily due to the centralized processing of sensitive user data. Private Transformer Inference (PTI) offers a solution by utilizing cryptographic techniques such as secure multi-party computation and homomorphic encryption, enabling inference while preserving both user data and model privacy. This paper reviews recent PTI advancements, highlighting state-of-the-art solutions and challenges. We also introduce a structured taxonomy and evaluation framework for PTI, focusing on balancing resource efficiency with privacy and bridging the gap between high-performance inference and data privacy.

External Datasets

GLUE

Wikitext-103 V1

CBT-CN

ImageNet

CIFAR

Tiny-ImageNet