Code Vulnerability Repair with Large Language Model using Context-Aware Prompt Tuning

Authors: Arshiya Khan, Guannan Liu, Xing Gao | Published: 2024-09-27 | Updated: 2025-06-11

2024.09.272025.06.13

Authors: Arshiya Khan, Guannan Liu, Xing Gao
Published: 2024-09-27 | Updated: 2025-06-11

Source: https://arxiv.org/abs/2409.18395

PDF: https://arxiv.org/pdf/2409.18395

Labels Predicted by AI

Large Language Model

Please note that these labels were automatically added by AI. Therefore, they may not be entirely accurate.
For more details, please see the About the Literature Database page.

Abstract

Large Language Models (LLMs) have shown significant challenges in detecting and repairing vulnerable code, particularly when dealing with vulnerabilities involving multiple aspects, such as variables, code flows, and code structures. In this study, we utilize GitHub Copilot as the LLM and focus on buffer overflow vulnerabilities. Our experiments reveal a notable gap in Copilot’s abilities when dealing with buffer overflow vulnerabilities, with a 76 vulnerability detection rate but only a 15 address this issue, we propose context-aware prompt tuning techniques designed to enhance LLM performance in repairing buffer overflow. By injecting a sequence of domain knowledge about the vulnerability, including various security and code contexts, we demonstrate that Copilot’s successful repair rate increases to 63 compared to repairs without domain knowledge.