Federated Domain-Specific Knowledge Transfer on Large Language Models Using Synthetic Data

TOP 文献データベース Federated Domain-Specific Knowledge Transfer on Large Language Models Using Synthetic Data

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2405.14212

PDF

https://arxiv.org/pdf/2405.14212

文献情報

作者: Haoran Li;Xinyuan Zhao;Dadi Guo;Hanlin Gu;Ziqian Zeng;Yuxing Han;Yangqiu Song;Lixin Fan;Qiang Yang
公開日: 2024-5-23
所属機関: The Hong Kong University of Science and Technology
所属の国: China
会議名

AIにより推定されたラベル

データセット生成プライバシー保護手法 Few-Shot Learning

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

As large language models (LLMs) demonstrate unparalleled performance and generalization ability, LLMs are widely used and integrated into various applications. When it comes to sensitive domains, as commonly described in federated learning scenarios, directly using external LLMs on private data is strictly prohibited by stringent data security and privacy regulations. For local clients, the utilization of LLMs to improve the domain-specific small language models (SLMs), characterized by limited computational resources and domain-specific data, has attracted considerable research attention. By observing that LLMs can empower domain-specific SLMs, existing methods predominantly concentrate on leveraging the public data or LLMs to generate more data to transfer knowledge from LLMs to SLMs. However, due to the discrepancies between LLMs' generated data and clients' domain-specific data, these methods cannot yield substantial improvements in the domain-specific tasks. In this paper, we introduce a Federated Domain-specific Knowledge Transfer (FDKT) framework, which enables domain-specific knowledge transfer from LLMs to SLMs while preserving clients' data privacy. The core insight is to leverage LLMs to augment data based on domain-specific few-shot demonstrations, which are synthesized from private domain data using differential privacy. Such synthetic samples share similar data distribution with clients' private data and allow the server LLM to generate particular knowledge to improve clients' SLMs. The extensive experimental results demonstrate that the proposed FDKT framework consistently and greatly improves SLMs' task performance by around 5\% with a privacy budget of less than 10, compared to local training on private data.

外部データセット

Yelp dataset

AGNews

参考文献

Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security

Deep learning with differential privacy

Martín Abadi, Andy Chu, Ian J. Goodfellow, H. B. McMahan, Ilya Mironov, Kunal Talwar, Li Zhang

Published: 2016

Llama 3 model card

AI@Meta

Published: 2024

Slora: Federated parameter efficient fine-tuning of language models