Hey, That's My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique

TOP 文献データベース Hey, That's My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2407.10887

PDF

https://arxiv.org/pdf/2407.10887

文献情報

作者: Mark Russinovich,Ahmed Salem
公開日: 2024-7-16
更新日: 2025-6-12
所属機関: Microsoft Azure
所属の国: United States of America
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

フィンガープリンティング手法インダイレクトプロンプトインジェクションプロンプトインジェクション

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Growing concerns over the theft and misuse of Large Language Models (LLMs) have heightened the need for effective fingerprinting, which links a model to its original version to detect misuse. In this paper, we define five key properties for a successful fingerprint: Transparency, Efficiency, Persistence, Robustness, and Unforgeability. We introduce a novel fingerprinting framework that provides verifiable proof of ownership while maintaining fingerprint integrity. Our approach makes two main contributions. First, we propose a Chain and Hash technique that cryptographically binds fingerprint prompts with their responses, ensuring no adversary can generate colliding fingerprints and allowing model owners to irrefutably demonstrate their creation. Second, we address a realistic threat model in which instruction-tuned models' output distribution can be significantly altered through meta-prompts. By integrating random padding and varied meta-prompt configurations during training, our method preserves fingerprint robustness even when the model's output style is significantly modified. Experimental results demonstrate that our framework offers strong security for proving ownership and remains resilient against benign transformations like fine-tuning, as well as adversarial attempts to erase fingerprints. Finally, we also demonstrate its applicability to fingerprinting LoRA adapters.

外部データセット

Alpaca

HealthCareMagic-100k

MMLU

HellaSwag

TruthfulQA

WinoGrande

ChatDoc