Text Embedding Inversion Security for Multilingual Language Models

TOP Literature Database Text Embedding Inversion Security for Multilingual Language Models

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2401.12192

PDF

https://arxiv.org/pdf/2401.12192

Paper Information

Author: Yiyi Chen;Heather Lent;Johannes Bjerva
Published: 1-23-2024
Updated: 6-5-2024
Affiliation: Department of Computer Science, Aalborg University
Country: Denmark
Conference

Labels Estimated by AI

Model Performance Evaluation Membership Inference Watermarking

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Textual data is often represented as real-numbered embeddings in NLP, particularly with the popularity of large language models (LLMs) and Embeddings as a Service (EaaS). However, storing sensitive information as embeddings can be susceptible to security breaches, as research shows that text can be reconstructed from embeddings, even without knowledge of the underlying model. While defence mechanisms have been explored, these are exclusively focused on English, leaving other languages potentially exposed to attacks. This work explores LLM security through multilingual embedding inversion. We define the problem of black-box multilingual and cross-lingual inversion attacks, and explore their potential implications. Our findings suggest that multilingual LLMs may be more vulnerable to inversion attacks, in part because English-based defences may be ineffective. To alleviate this, we propose a simple masking defense effective for both monolingual and multilingual models. This study is the first to investigate multilingual inversion attacks, shedding light on the differences in attacks and defenses across monolingual and multilingual settings.

External Datasets

MSMarco

Natural Questions

MTG

MTG-EN

MTG-FR

MTG-DE

MTG-ES

MTG-MULTI

CulturaX