TextHide: Tackling Data Privacy in Language Understanding Tasks

TOP 文献データベース TextHide: Tackling Data Privacy in Language Understanding Tasks

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2010.06053

PDF

https://arxiv.org/pdf/2010.06053

文献情報

作者: Yangsibo Huang,Zhao Song,Danqi Chen,Kai Li,Sanjeev Arora
公開日: 2020-10-13
所属機関: Princeton University
所属の国: United States of America
会議名

AIにより推定されたラベル

メンバーシップ推論自然言語処理情報理論的プライバシー

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

An unsolved challenge in distributed or federated learning is to effectively mitigate privacy risks without slowing down training or reducing accuracy. In this paper, we propose TextHide aiming at addressing this challenge for natural language understanding tasks. It requires all participants to add a simple encryption step to prevent an eavesdropping attacker from recovering private text data. Such an encryption step is efficient and only affects the task performance slightly. In addition, TextHide fits well with the popular framework of fine-tuning pre-trained language models (e.g., BERT) for any sentence or sentence-pair task. We evaluate TextHide on the GLUE benchmark, and our experiments show that TextHide can effectively defend attacks on shared gradients or representations and the averaged accuracy reduction is only $1.9\%$. We also present an analysis of the security of TextHide using a conjecture about the computational intractability of a mathematical problem. Our code is available at https://github.com/Hazelsuko07/TextHide