Differentially Private Training of Mixture of Experts Models

TOP Literature Database Differentially Private Training of Mixture of Experts Models

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2402.07334

PDF

https://arxiv.org/pdf/2402.07334

Paper Information

Author: Pierre Tholoniat;Huseyin A. Inan;Janardhan Kulkarni;Robert Sim
Published: 2-12-2024
Affiliation: Columbia University
Country: United States of America
Conference: Computing Research Repository (CoRR)

Labels Estimated by AI

LLM Performance Evaluation Privacy Protection Method Model Performance Evaluation

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

This position paper investigates the integration of Differential Privacy (DP) in the training of Mixture of Experts (MoE) models within the field of natural language processing. As Large Language Models (LLMs) scale to billions of parameters, leveraging expansive datasets, they exhibit enhanced linguistic capabilities and emergent abilities. However, this growth raises significant computational and privacy concerns. Our study addresses these issues by exploring the potential of MoE models, known for their computational efficiency, and the application of DP, a standard for privacy preservation. We present the first known attempt to train MoE models under the constraints of DP, addressing the unique challenges posed by their architecture and the complexities of DP integration. Our initial experimental studies demonstrate that MoE models can be effectively trained with DP, achieving performance that is competitive with their non-private counterparts. This initial study aims to provide valuable insights and ignite further research in the domain of privacy-preserving MoE models, softly laying the groundwork for prospective developments in this evolving field.

External Datasets

SST-2

MNLI