Signed-Prompt: A New Approach to Prevent Prompt Injection Attacks Against LLM-Integrated Applications

TOP 文献データベース Signed-Prompt: A New Approach to Prevent Prompt Injection Attacks Against LLM-Integrated Applications

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2401.07612

PDF

https://arxiv.org/pdf/2401.07612

文献情報

作者: Xuchen Suo
公開日: 2024-1-15
所属機関: Department of Electrical and Electronic Engineering, The Hong Kong Polytechnic University
所属の国: Hong Kong, China
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

プロンプトインジェクション LLMセキュリティ

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

The critical challenge of prompt injection attacks in Large Language Models (LLMs) integrated applications, a growing concern in the Artificial Intelligence (AI) field. Such attacks, which manipulate LLMs through natural language inputs, pose a significant threat to the security of these applications. Traditional defense strategies, including output and input filtering, as well as delimiter use, have proven inadequate. This paper introduces the 'Signed-Prompt' method as a novel solution. The study involves signing sensitive instructions within command segments by authorized users, enabling the LLM to discern trusted instruction sources. The paper presents a comprehensive analysis of prompt injection attack patterns, followed by a detailed explanation of the Signed-Prompt concept, including its basic architecture and implementation through both prompt engineering and fine-tuning of LLMs. Experiments demonstrate the effectiveness of the Signed-Prompt method, showing substantial resistance to various types of prompt injection attacks, thus validating its potential as a robust defense strategy in AI security.

外部データセット

Delete Command Dataset