Web IP at Risk: Prevent Unauthorized Real-Time Retrieval by Large Language Models

TOP 文献データベース Web IP at Risk: Prevent Unauthorized Real-Time Retrieval by Large Language Models

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2505.12655

PDF

https://arxiv.org/pdf/2505.12655

文献情報

作者: Yisheng Zhong,Yizhu Wen,Junfeng Guo,Mehran Kafai,Heng Huang,Hanqing Guo,Zhuangdi Zhu
公開日: 2025-5-19
所属機関: George Mason University
所属の国: United States of America
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

LLMセキュリティインダイレクトプロンプトインジェクションプライバシー管理

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Protecting cyber Intellectual Property (IP) such as web content is an increasingly critical concern. The rise of large language models (LLMs) with online retrieval capabilities presents a double-edged sword that enables convenient access to information but often undermines the rights of original content creators. As users increasingly rely on LLM-generated responses, they gradually diminish direct engagement with original information sources, significantly reducing the incentives for IP creators to contribute, and leading to a saturating cyberspace with more AI-generated content. In response, we propose a novel defense framework that empowers web content creators to safeguard their web-based IP from unauthorized LLM real-time extraction by leveraging the semantic understanding capability of LLMs themselves. Our method follows principled motivations and effectively addresses an intractable black-box optimization problem. Real-world experiments demonstrated that our methods improve defense success rates from 2.5% to 88.6% on different LLMs, outperforming traditional defenses such as configuration-based restrictions.