Does Prompt-Tuning Language Model Ensure Privacy?

Neural networks are known to be vulnerable to adversarial examples. In this note, we evaluate the two white-box defenses that appeared at CVPR 2018 and find they are ineffective: when applying existing techniques, we can reduce the accuracy of the defended models to 0%.

Certified Robustness Watermark Adversarial attack

OpenAI Technical Report

Language models are few-shot learners

T. B. Brown, B. Mann, N. Ryder, M. Subbiah, J. Kaplan, P. Dhariwal, A. Neelakantan, P. Shyam, G. Sastry, A. Askell, S. Agarwal, A. Herbert-Voss, G. Krueger, T. Henighan, R. Child, A. Ramesh, D. M. Ziegler, J. Wu, C. Winter, C. Hesse, M. Chen, E. Sigler, M. Litwin, S. Gray, B. Chess, J. Clark, C. Berner, S. McCandlish, A. Radford, I. Sutskever, D. Amodei

Published: 2020

arxiv

Cited by 1

USENIX Security Symposium

The Secret Sharer: Evaluating and Testing Unintended Memorization in Neural Networks

Nicholas Carlini, Chang Liu, Úlfar Erlingsson, Jernej Kos, Dawn Song

Published: 2.23.2018

This paper describes a testing methodology for quantitatively assessing the risk that rare or unique training-data sequences are unintentionally memorized by generative sequence models---a common type of machine-learning model. Because such models are sometimes trained on sensitive data (e.g., the text of users' private messages), this methodology can benefit privacy by allowing deep-learning practitioners to select means of training that minimize such memorization. In experiments, we show that unintended memorization is a persistent, hard-to-avoid issue that can have serious consequences. Specifically, for models trained without consideration of memorization, we describe new, efficient procedures that can extract unique, secret sequences, such as credit card numbers. We show that our testing strategy is a practical and easy-to-use first line of defense, e.g., by describing its application to quantitatively limit data exposure in Google's Smart Compose, a commercial text-completion neural network trained on millions of users' email messages.

Differential Privacy Privacy Protection Mechanism Information-Theoretic Evaluation

Preprint

Extracting training data from large language models

Nicholas Carlini, Florian Tramer, Eric Wallace, Matthew Jagielski, Ariel Herbert-Voss, Katherine Lee, Adam Roberts, Tom Brown, Dawn Song, Ulfar Erlingsson, Alina Oprea, Colin Raffel

Published: 2021

Science

Gauging similarity with n-grams: Language-independent categorization of text

Marc Damashek

Published: 1995

Proceedings of NAACL-HLT

Bert: Pre-training of deep bidirectional transformers for language understanding

Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova

Published: 2019

Theory of Cryptography

Calibrating noise to sensitivity in private data analysis

Cynthia Dwork, Frank McSherry, Kobbi Nissim, Adam Smith

Published: 2006

Proceedings of the Fourth Workshop on Privacy in Natural Language Processing

Privacy leakage in text classification a data extraction approach

Adel Elmahdy, Huseyin A. Inan, Robert Sim

Published: 2022

J. Mach. Learn. Res.

Beyond english-centric multilingual machine translation

Angela Fan, Shruti Bhosale, Holger Schwenk, Zhiyi Ma, Ahmed El-Kishky, Siddharth Goyal, Mandeep Baines, Onur Celebi, Guillaume Wenzek, Vishrav Chaudhary

Published: 2021

Transactions of the Association for Computational Linguistics

Membership inference attacks on sequence-to-sequence models: Is my data in your machine translation system?

S. Hisamoto, M. Post, K. Duh

Published: 2020

Transactions of the Association for Computational Linguistics

How can we know what language models know?

Zhengbao Jiang, Frank F. Xu, Jun Araki, Graham Neubig

Published: 2020

arXiv

Sample efficient text summarization using a single pre-trained transformer

Urvashi Khandelwal, Kevin Clark, Dan Jurafsky, Lukasz Kaiser

Published: 2019

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing

The power of scale for parameter-efficient prompt tuning

Brian Lester, Rami Al-Rfou, Noah Constant

Published: 2021

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics

Bart: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension

Mike Lewis, Yinhan Liu, Naman Goyal, Marjan Ghazvininejad, Abdelrahman Mohamed, Omer Levy, Veselin Stoyanov, Luke Zettlemoyer

Published: 2020

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)

Prefix-tuning: Optimizing continuous prompts for generation

Xiang Lisa Li, Percy Liang

Published: 2021

arXiv

Pretrain, prompt, and predict: A systematic survey of prompting methods in natural language processing

Pengfei Liu, Weizhe Yuan, Jinlan Fu, Zhengbao Jiang, Hiroaki Hayashi, Graham Neubig

Published: 2021

Membership inference on word embedding and beyond

Saeed Mahloujifar, Huseyin A Inan, Melissa Chase, Esha Ghosh, Marcello Hasegawa

Published: 2021

arxiv

Cited by 1

Communication-Efficient Learning of Deep Networks from Decentralized Data

H. Brendan McMahan, Eider Moore, Daniel Ramage, Seth Hampson, Blaise Agüera y Arcas

Published: 2.18.2016

Modern mobile devices have access to a wealth of data suitable for learning models, which in turn can greatly improve the user experience on the device. For example, language models can improve speech recognition and text entry, and image models can automatically select good photos. However, this rich data is often privacy sensitive, large in quantity, or both, which may preclude logging to the data center and training there using conventional approaches. We advocate an alternative that leaves the training data distributed on the mobile devices, and learns a shared model by aggregating locally-computed updates. We term this decentralized approach Federated Learning. We present a practical method for the federated learning of deep networks based on iterative model averaging, and conduct an extensive empirical evaluation, considering five different model architectures and four datasets. These experiments demonstrate the approach is robust to the unbalanced and non-IID data distributions that are a defining characteristic of this setting. Communication costs are the principal constraint, and we show a reduction in required communication rounds by 10-100x as compared to synchronized stochastic gradient descent.

Deep Learning Method Federated Learning Reduction of Communication Costs

arXiv

Quantifying privacy risks of masked language models using membership inference attacks

Fatemehsadat Mireshghallah, Kartik Goyal, Archit Uniyal, Taylor Berg-Kirkpatrick, Reza Shokri

Published: 2022

Linguistic Data Consortium

Avocado research email collection

Douglas Oard, William Webber, David Kirsch, Sergey Golitsynskiy

Published: 2015

OpenAI blog

Language models are unsupervised multitask learners

A. Radford, J. Wu, R. Child, D. Luan, D. Amodei, I. Sutskever