LLM4CVE: Enabling Iterative Automated Vulnerability Repair with Large Language Models

TOP Literature Database LLM4CVE: Enabling Iterative Automated Vulnerability Repair with Large Language Models

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2501.03446

PDF

https://arxiv.org/pdf/2501.03446

Paper Information

Author: Mohamad Fakih;Rahul Dharmaji;Halima Bouzidi;Gustavo Quiros Araya;Oluwatosin Ogundare;Mohammad Abdullah Al Faruque
Published: 1-7-2025
Affiliation: Dept. of Electrical Engineering and Computer Science, University of California, Irvine
Country: United States of America
Conference: Euromicro Symposium on Digital Systems Design (DSD)

Labels Estimated by AI

Automated Vulnerability Remediation LLM Performance Evaluation Prompt Engineering

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Software vulnerabilities continue to be ubiquitous, even in the era of AI-powered code assistants, advanced static analysis tools, and the adoption of extensive testing frameworks. It has become apparent that we must not simply prevent these bugs, but also eliminate them in a quick, efficient manner. Yet, human code intervention is slow, costly, and can often lead to further security vulnerabilities, especially in legacy codebases. The advent of highly advanced Large Language Models (LLM) has opened up the possibility for many software defects to be patched automatically. We propose LLM4CVE an LLM-based iterative pipeline that robustly fixes vulnerable functions in real-world code with high accuracy. We examine our pipeline with State-of-the-Art LLMs, such as GPT-3.5, GPT-4o, Llama 38B, and Llama 3 70B. We achieve a human-verified quality score of 8.51/10 and an increase in groundtruth code similarity of 20% with Llama 3 70B. To promote further research in the area of LLM-based vulnerability repair, we publish our testing apparatus, fine-tuned weights, and experimental data on our website

External Datasets

CVEFixes