Cape: Context-Aware Prompt Perturbation Mechanism with Differential Privacy

TOP Literature Database Cape: Context-Aware Prompt Perturbation Mechanism with Differential Privacy

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2505.05922

PDF

https://arxiv.org/pdf/2505.05922

Paper Information

Author: Haoqi Wu,Wei Dai,Li Wang,Qiang Yan
Published: 5-9-2025
Updated: 5-15-2025
Affiliation: TikTok
Country: China
Conference: International Conference on Machine Learning (ICML)

Labels Estimated by AI

Privacy Design Principles Token Identification Method Evaluation Method

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Large Language Models (LLMs) have gained significant popularity due to their remarkable capabilities in text understanding and generation. However, despite their widespread deployment in inference services such as ChatGPT, concerns about the potential leakage of sensitive user data have arisen. Existing solutions primarily rely on privacy-enhancing technologies to mitigate such risks, facing the trade-off among efficiency, privacy, and utility. To narrow this gap, we propose Cape, a context-aware prompt perturbation mechanism based on differential privacy, to enable efficient inference with an improved privacy-utility trade-off. Concretely, we introduce a hybrid utility function that better captures the token similarity. Additionally, we propose a bucketized sampling mechanism to handle large sampling space, which might lead to long-tail phenomenons. Extensive experiments across multiple datasets, along with ablation studies, demonstrate that Cape achieves a better privacy-utility trade-off compared to prior state-of-the-art works.

External Datasets

SST-2

QNLI

Wikitext-103-v1