Seeing the Forest through the Trees: Data Leakage from Partial Transformer Gradients

TOP Literature Database Seeing the Forest through the Trees: Data Leakage from Partial Transformer Gradients

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2406.00999

PDF

https://arxiv.org/pdf/2406.00999

Paper Information

Author: Weijun Li;Qiongkai Xu;Mark Dras
Published: 6-3-2024
Updated: 10-4-2024
Affiliation: School of Computing, FSE, Macquarie University
Country: Australia
Conference

Labels Estimated by AI

Privacy Protection Method Watermarking Data Privacy Assessment

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Recent studies have shown that distributed machine learning is vulnerable to gradient inversion attacks, where private training data can be reconstructed by analyzing the gradients of the models shared in training. Previous attacks established that such reconstructions are possible using gradients from all parameters in the entire models. However, we hypothesize that most of the involved modules, or even their sub-modules, are at risk of training data leakage, and we validate such vulnerabilities in various intermediate layers of language models. Our extensive experiments reveal that gradients from a single Transformer layer, or even a single linear component with 0.54% parameters, are susceptible to training data leakage. Additionally, we show that applying differential privacy on gradients during training offers limited protection against the novel vulnerability of data disclosure.

External Datasets

CoLA

SST-2

Rotten Tomatoes