AIセキュリティポータル K Program
Initial Exploration of Zero-Shot Privacy Utility Tradeoffs in Tabular Data Using GPT-4
Share
Abstract
We investigate the application of large language models (LLMs), specifically GPT-4, to scenarios involving the tradeoff between privacy and utility in tabular data. Our approach entails prompting GPT-4 by transforming tabular data points into textual format, followed by the inclusion of precise sanitization instructions in a zero-shot manner. The primary objective is to sanitize the tabular data in such a way that it hinders existing machine learning models from accurately inferring private features while allowing models to accurately infer utility-related attributes. We explore various sanitization instructions. Notably, we discover that this relatively simple approach yields performance comparable to more complex adversarial optimization methods used for managing privacy-utility tradeoffs. Furthermore, while the prompts successfully obscure private features from the detection capabilities of existing machine learning models, we observe that this obscuration alone does not necessarily meet a range of fairness metrics. Nevertheless, our research indicates the potential effectiveness of LLMs in adhering to these fairness metrics, with some of our experimental results aligning with those achieved by well-established adversarial optimization techniques.
Language models are few-shot learners
T. B. Brown, B. Mann, N. Ryder, M. Subbiah, J. Kaplan, P. Dhariwal, A. Neelakantan, P. Shyam, G. Sastry, A. Askell, S. Agarwal, A. Herbert-Voss, G. Krueger, T. Henighan, R. Child, A. Ramesh, D. M. Ziegler, J. Wu, C. Winter, C. Hesse, M. Chen, E. Sigler, M. Litwin, S. Gray, B. Chess, J. Clark, C. Berner, S. McCandlish, A. Radford, I. Sutskever, D. Amodei
Published: 2020
Tabllm: Few-shot classification of tabular data with large language models
S. Hegselmann, A. Buendia, H. Lang, M. Agrawal, X. Jiang, D. Sontag
Published: 2023
Extracting training data from large language models
N. Carlini, F. Tramer, E. Wallace, M. Jagielski, A. Herbert-Voss, K. Lee, A. Roberts, T. Brown, D. Song, U. Erlingsson, A. Oprea, C. Raffel
Published: 2021
Ewtune: A framework for privately fine-tuning large language models with differential privacy
R. Behnia, M. R. Ebrahimi, J. Pacheco, B. Padmanabhan
Published: 2022
Toward information privacy for the internet of things: A nonparametric learning approach
M. Sun, W. P. Tay, X. He
Published: 2018
On the relationship between inference and data privacy in decentralized iot networks
M. Sun, W. P. Tay
Published: 2020
Mope: Model perturbation-based privacy attacks on language models
M. Li, J. Wang, J. Wang, S. Neel
Published: 2023
Privacy implications of retrieval-based language models
Y. Huang, S. Gupta, Z. Zhong, K. Li, D. Chen
Published: 2023
Retiring adult: New datasets for fair machine learning
Frances Ding, Moritz Hardt, John Miller, Ludwig Schmidt
Published: 2021
Learning adversarially fair and transferable representations
D. Madras, E. Creager, T. Pitassi, R. Zemel
Published: 2018
Learning privacy preserving encodings through adversarial training
F. Pittaluga, S. J. Koppal, A. Chakrabarti
Published: 2018
Deep private-feature extraction
S. A. Osia, A. Taheri, A. S. Shamsabadi, K. Katevas, H. Haddadi, H. R. Rabiee
Published: 2020
Generative adversarial privacy
C. Huang, P. Kairouz, X. Chen, L. Sankar, R. Rajagopal
Published: 2019
Distributed generation of privacy preserving data with user customization
X. Chen, T. Navidi, S. Ermon, R. Rajagopal
Published: 2019
Towards privacy-preserving visual recognition via adversarial training: A pilot study
Z. Wu, Z. Wang, Z. Wang, H. Jin
Published: 2020
Sensitivenets: Learning agnostic representations with application to face images
A. Morales, J. Fierrez, R. Vera-Rodriguez, R. Tolosana
Published: 2020
Adversarial Learning of Privacy-Preserving and Task-Oriented Representations
Taihong Xiao, Yi-Hsuan Tsai, Kihyuk Sohn, Manmohan Chandraker, Ming-Hsuan Yang
Published: 2019.11.23
Active privacy-utility tradeoff against a hypothesis testing adversary
E. Erdemir, P. L. Dragotti, D. Gunduz
Published: 2021
Privacy-preserving deep action recognition: An adversarial learning framework and a new dataset
Z. Wu, H. Wang, Z. Wang, H. Jin, Z. Wang
Published: 2021
Uncertainty-autoencoder-based privacy and utility preserving data type conscious transformation
B. Mandal, G. Amariucai, S. Wei
Published: 2022
A practical approach to navigating the tradeoff between privacy and precise utility
C. Sharma, B. Mandal, G. Amariucai
Published: 2021
Interpreting disparate privacy-utility tradeoff in adversarial learning via attribute correlation
L. Zhang, Y. Chen, A. Li, B. Wang, Y. Chen, F. Li, J. Cao, B. Niu
Published: 2023
Fairness metrics: A comparative analysis
P. Garg, J. Villasenor, V. Foggo
Published: 2020
Share