Initial Exploration of Zero-Shot Privacy Utility Tradeoffs in Tabular Data Using GPT-4

OpenAI Technical Report

Language models are few-shot learners

T. B. Brown, B. Mann, N. Ryder, M. Subbiah, J. Kaplan, P. Dhariwal, A. Neelakantan, P. Shyam, G. Sastry, A. Askell, S. Agarwal, A. Herbert-Voss, G. Krueger, T. Henighan, R. Child, A. Ramesh, D. M. Ziegler, J. Wu, C. Winter, C. Hesse, M. Chen, E. Sigler, M. Litwin, S. Gray, B. Chess, J. Clark, C. Berner, S. McCandlish, A. Radford, I. Sutskever, D. Amodei

Published: 2020

International Conference on Artificial Intelligence and Statistics

Tabllm: Few-shot classification of tabular data with large language models

S. Hegselmann, A. Buendia, H. Lang, M. Agrawal, X. Jiang, D. Sontag

Published: 2023

Extracting training data from large language models

N. Carlini, F. Tramer, E. Wallace, M. Jagielski, A. Herbert-Voss, K. Lee, A. Roberts, T. Brown, D. Song, U. Erlingsson, A. Oprea, C. Raffel

Published: 2021

arXiv

Beyond memorization: Violating privacy via inference with large language models

R. Staab, M. Vero, M. Balunovic, M. Vechev

Published: 2023

2022 IEEE International Conference on Data Mining Workshops (ICDMW)

Ewtune: A framework for privately fine-tuning large language models with differential privacy

R. Behnia, M. R. Ebrahimi, J. Pacheco, B. Padmanabhan

Published: 2022

Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining

Privacy in the time of language models

C. Peris, C. Dupuy, J. Majmudar, R. Parikh, S. Smaili, R. Zemel, R. Gupta

Published: 2023

IEEE Transactions on Signal Processing

Toward information privacy for the internet of things: A nonparametric learning approach

M. Sun, W. P. Tay, X. He

Published: 2018

IEEE Transactions on Information Forensics and Security

On the relationship between inference and data privacy in decentralized iot networks

M. Sun, W. P. Tay

Published: 2020

arxiv

被引用数 1

Computing Research Repository (CoRR)

Scalable Extraction of Training Data from (Production) Language Models

Milad Nasr, Nicholas Carlini, Jonathan Hayase, Matthew Jagielski, A. Feder Cooper, Daphne Ippolito, Christopher A. Choquette-Choo, Eric Wallace, Florian Tramèr, Katherine Lee

Published: 2023.11.29

This paper studies extractable memorization: training data that an adversary can efficiently extract by querying a machine learning model without prior knowledge of the training dataset. We show an adversary can extract gigabytes of training data from open-source language models like Pythia or GPT-Neo, semi-open models like LLaMA or Falcon, and closed models like ChatGPT. Existing techniques from the literature suffice to attack unaligned models; in order to attack the aligned ChatGPT, we develop a new divergence attack that causes the model to diverge from its chatbot-style generations and emit training data at a rate 150x higher than when behaving properly. Our methods show practical attacks can recover far more data than previously thought, and reveal that current alignment techniques do not eliminate memorization.

プロンプトインジェクショントレーニングデータ抽出手法データ漏洩

Mope: Model perturbation-based privacy attacks on language models

M. Li, J. Wang, J. Wang, S. Neel

Published: 2023

arxiv

被引用数 2

DEPN: Detecting and Editing Privacy Neurons in Pretrained Language Models

Xinwei Wu, Junzhuo Li, Minghui Xu, Weilong Dong, Shuangzhi Wu, Chao Bian, Deyi Xiong

Published: 2023.10.31

Large language models pretrained on a huge amount of data capture rich knowledge and information in the training data. The ability of data memorization and regurgitation in pretrained language models, revealed in previous studies, brings the risk of data leakage. In order to effectively reduce these risks, we propose a framework DEPN to Detect and Edit Privacy Neurons in pretrained language models, partially inspired by knowledge neurons and model editing. In DEPN, we introduce a novel method, termed as privacy neuron detector, to locate neurons associated with private information, and then edit these detected privacy neurons by setting their activations to zero. Furthermore, we propose a privacy neuron aggregator dememorize private information in a batch processing manner. Experimental results show that our method can significantly and efficiently reduce the exposure of private data leakage without deteriorating the performance of the model. Additionally, we empirically demonstrate the relationship between model memorization and privacy neurons, from multiple perspectives, including model size, training time, prompts, privacy neuron distribution, illustrating the robustness of our approach.

プライバシー手法モデル編集手法プライバシー保護手法

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

Preserving privacy through dememorization: An unlearning technique for mitigating memorization risks in language models

A. Kassem, O. Mahmoud, S. Saad

Published: 2023

Privacy implications of retrieval-based language models

Y. Huang, S. Gupta, Z. Zhong, K. Li, D. Chen

Published: 2023

Advances in Neural Information Processing Systems

Retiring adult: New datasets for fair machine learning

Frances Ding, Moritz Hardt, John Miller, Ludwig Schmidt

Published: 2021

arXiv

Censoring representations with an adversary

Edwards, H., Storkey, A.

Published: 2015

Entropy

Context-aware generative adversarial privacy

C. Huang, P. Kairouz, X. Chen, L. Sankar, R. Rajagopal

Published: 2017

Learning adversarially fair and transferable representations

D. Madras, E. Creager, T. Pitassi, R. Zemel

Published: 2018

Learning privacy preserving encodings through adversarial training

F. Pittaluga, S. J. Koppal, A. Chakrabarti

Published: 2018

IEEE Transactions on Knowledge and Data Engineering

Deep private-feature extraction

S. A. Osia, A. Taheri, A. S. Shamsabadi, K. Katevas, H. Haddadi, H. R. Rabiee

Published: 2020

Generative adversarial privacy

C. Huang, P. Kairouz, X. Chen, L. Sankar, R. Rajagopal

Published: 2019

Distributed generation of privacy preserving data with user customization

X. Chen, T. Navidi, S. Ermon, R. Rajagopal

Published: 2019

Towards privacy-preserving visual recognition via adversarial training: A pilot study

Z. Wu, Z. Wang, Z. Wang, H. Jin

Published: 2020

Sensitivenets: Learning agnostic representations with application to face images

A. Morales, J. Fierrez, R. Vera-Rodriguez, R. Tolosana

Published: 2020

arxiv

被引用数 1

Adversarial Learning of Privacy-Preserving and Task-Oriented Representations

Taihong Xiao, Yi-Hsuan Tsai, Kihyuk Sohn, Manmohan Chandraker, Ming-Hsuan Yang

Published: 2019.11.23

Data privacy has emerged as an important issue as data-driven deep learning has been an essential component of modern machine learning systems. For instance, there could be a potential privacy risk of machine learning systems via the model inversion attack, whose goal is to reconstruct the input data from the latent representation of deep networks. Our work aims at learning a privacy-preserving and task-oriented representation to defend against such model inversion attacks. Specifically, we propose an adversarial reconstruction learning framework that prevents the latent representations decoded into original input data. By simulating the expected behavior of adversary, our framework is realized by minimizing the negative pixel reconstruction loss or the negative feature reconstruction (i.e., perceptual distance) loss. We validate the proposed method on face attribute prediction, showing that our method allows protecting visual privacy with a small decrease in utility performance. In addition, we show the utility-privacy trade-off with different choices of hyperparameter for negative perceptual distance loss at training, allowing service providers to determine the right level of privacy-protection with a certain utility performance. Moreover, we provide an extensive study with different selections of features, tasks, and the data to further analyze their influence on privacy protection.

ポイズニングプライバシー保護データマイニングメンバーシップ推論

Active privacy-utility tradeoff against a hypothesis testing adversary

E. Erdemir, P. L. Dragotti, D. Gunduz

Published: 2021

Privacy-preserving deep action recognition: An adversarial learning framework and a new dataset

Z. Wu, H. Wang, Z. Wang, H. Jin, Z. Wang

Published: 2021

2022 International Joint Conference on Neural Networks (IJCNN)

Uncertainty-autoencoder-based privacy and utility preserving data type conscious transformation

B. Mandal, G. Amariucai, S. Wei

Published: 2022

ICC 2021 - IEEE International Conference on Communications

A practical approach to navigating the tradeoff between privacy and precise utility

C. Sharma, B. Mandal, G. Amariucai

Published: 2021

Uncertainty-Autoencoder-Based Privacy and Utility Preserving Data Type Conscious Transformation

UCI Machine Learning Repository

Adult

B. Becker, R. Kohavi

Published: 1996

2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

Interpreting disparate privacy-utility tradeoff in adversarial learning via attribute correlation

L. Zhang, Y. Chen, A. Li, B. Wang, Y. Chen, F. Li, J. Cao, B. Niu

Published: 2023

Fairness metrics: A comparative analysis

P. Garg, J. Villasenor, V. Foggo

Published: 2020