Privacy-Preserving In-Context Learning with Differentially Private Few-Shot Generation

TOP Literature Database Privacy-Preserving In-Context Learning with Differentially Private Few-Shot Generation

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2309.11765

PDF

https://arxiv.org/pdf/2309.11765

Paper Information

Author: Xinyu Tang;Richard Shin;Huseyin A. Inan;Andre Manoel;Fatemehsadat Mireshghallah;Zinan Lin;Sivakanth Gopi;Janardhan Kulkarni;Robert Sim
Published: 9-21-2023
Updated: 1-28-2024
Affiliation: Princeton University
Country: United States of America
Conference

Labels Estimated by AI

Privacy Technique Data Generation Data Protection Method

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

We study the problem of in-context learning (ICL) with large language models (LLMs) on private datasets. This scenario poses privacy risks, as LLMs may leak or regurgitate the private examples demonstrated in the prompt. We propose a novel algorithm that generates synthetic few-shot demonstrations from the private dataset with formal differential privacy (DP) guarantees, and show empirically that it can achieve effective ICL. We conduct extensive experiments on standard benchmarks and compare our algorithm with non-private ICL and zero-shot solutions. Our results demonstrate that our algorithm can achieve competitive performance with strong privacy levels. These results open up new possibilities for ICL with privacy protection for a broad range of applications.

External Datasets

AGNews

TREC

DBPedia

MIT Movies trivia10k