CoTSRF: Utilize Chain of Thought as Stealthy and Robust Fingerprint of Large Language Models

TOP Literature Database CoTSRF: Utilize Chain of Thought as Stealthy and Robust Fingerprint of Large Language Models

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2505.16785

PDF

https://arxiv.org/pdf/2505.16785

Paper Information

Author: Zhenzhen Ren,GuoBiao Li,Sheng Li,Zhenxing Qian,Xinpeng Zhang
Published: 5-23-2025
Affiliation: Fudan University
Country: China
Conference

Labels Estimated by AI

Fingerprinting Method LLM Security Model Identification

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Despite providing superior performance, open-source large language models (LLMs) are vulnerable to abusive usage. To address this issue, recent works propose LLM fingerprinting methods to identify the specific source LLMs behind suspect applications. However, these methods fail to provide stealthy and robust fingerprint verification. In this paper, we propose a novel LLM fingerprinting scheme, namely CoTSRF, which utilizes the Chain of Thought (CoT) as the fingerprint of an LLM. CoTSRF first collects the responses from the source LLM by querying it with crafted CoT queries. Then, it applies contrastive learning to train a CoT extractor that extracts the CoT feature (i.e., fingerprint) from the responses. Finally, CoTSRF conducts fingerprint verification by comparing the Kullback-Leibler divergence between the CoT features of the source and suspect LLMs against an empirical threshold. Various experiments have been conducted to demonstrate the advantage of our proposed CoTSRF for fingerprinting LLMs, particularly in stealthy and robust fingerprint verification.