SimMark: A Robust Sentence-Level Similarity-Based Watermarking Algorithm for Large Language Models

TOP Literature Database SimMark: A Robust Sentence-Level Similarity-Based Watermarking Algorithm for Large Language Models

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2502.02787

PDF

https://arxiv.org/pdf/2502.02787

Paper Information

Author: Amirhossein Dabiriaghdam,Lele Wang
Published: 2-5-2025
Updated: 9-11-2025
Affiliation: Department of ECE, University of British Columbia
Country: Canada
Conference

Labels Estimated by AI

Watermark Design Digital Watermarking for Generative AI Robustness Analysis

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

The widespread adoption of large language models (LLMs) necessitates reliable methods to detect LLM-generated text. We introduce SimMark, a robust sentence-level watermarking algorithm that makes LLMs' outputs traceable without requiring access to model internals, making it compatible with both open and API-based LLMs. By leveraging the similarity of semantic sentence embeddings combined with rejection sampling to embed detectable statistical patterns imperceptible to humans, and employing a soft counting mechanism, SimMark achieves robustness against paraphrasing attacks. Experimental results demonstrate that SimMark sets a new benchmark for robust watermarking of LLM-generated content, surpassing prior sentence-level watermarking techniques in robustness, sampling efficiency, and applicability across diverse domains, all while maintaining the text quality and fluency.