AutoPatch: Multi-Agent Framework for Patching Real-World CVE Vulnerabilities

TOP 文献データベース AutoPatch: Multi-Agent Framework for Patching Real-World CVE Vulnerabilities

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2505.04195

PDF

https://arxiv.org/pdf/2505.04195

文献情報

作者: Minjae Seo,Wonwoo Choi,Myoungsung You,Seungwon Shin
公開日: 2025-5-7
所属機関: Electronics and Telecommunications Research Institute
所属の国: South Korea
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

RAG 脆弱性分析モデルDoS

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Large Language Models (LLMs) have emerged as promising tools in software development, enabling automated code generation and analysis. However, their knowledge is limited to a fixed cutoff date, making them prone to generating code vulnerable to newly disclosed CVEs. Frequent fine-tuning with new CVE sets is costly, and existing LLM-based approaches focus on oversimplified CWE examples and require providing explicit bug locations to LLMs, limiting their ability to patch complex real-world vulnerabilities. To address these limitations, we propose AutoPatch, a multi-agent framework designed to patch vulnerable LLM-generated code, particularly those introduced after the LLMs' knowledge cutoff. AutoPatch integrates Retrieval-Augmented Generation (RAG) with a structured database of recently disclosed vulnerabilities, comprising 525 code snippets derived from 75 high-severity CVEs across real-world systems such as the Linux kernel and Chrome. AutoPatch combines semantic and taint analysis to identify the most relevant CVE and leverages enhanced Chain-of-Thought (CoT) reasoning to construct enriched prompts for verification and patching. Our unified similarity model, which selects the most relevant vulnerabilities, achieves 90.4 percent accuracy in CVE matching. AutoPatch attains 89.5 percent F1-score for vulnerability verification and 95.0 percent accuracy in patching, while being over 50x more cost-efficient than traditional fine-tuning approaches.

外部データセット

high-severity CVEs

GitHub Advisory Database

Openwall

Chromium issue tracker