Hacking, The Lazy Way: LLM Augmented Pentesting

TOP Literature Database Hacking, The Lazy Way: LLM Augmented Pentesting

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/2409.09493

PDF

https://arxiv.org/pdf/2409.09493

Paper Information

Author: Dhruva Goyal,Sitaraman Subramanian,Aditya Peela,Nisha P. Shetty
Published: 9-15-2024
Updated: 5-20-2025
Affiliation: Chief Executive Officer, BugBase
Country: United States of America
Conference: Computing Research Repository (CoRR)

Labels Estimated by AI

Penetration Testing Applicability File Analysis Method Prompt Engineering

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

In our research, we introduce a new concept called "LLM Augmented Pentesting" demonstrated with a tool named "Pentest Copilot," that revolutionizes the field of ethical hacking by integrating Large Language Models (LLMs) into penetration testing workflows, leveraging the advanced GPT-4-turbo model. Our approach focuses on overcoming the traditional resistance to automation in penetration testing by employing LLMs to automate specific sub-tasks while ensuring a comprehensive understanding of the overall testing process. Pentest Copilot showcases remarkable proficiency in tasks such as utilizing testing tools, interpreting outputs, and suggesting follow-up actions, efficiently bridging the gap between automated systems and human expertise. By integrating a "chain of thought" mechanism, Pentest Copilot optimizes token usage and enhances decision-making processes, leading to more accurate and context-aware outputs. Additionally, our implementation of Retrieval-Augmented Generation (RAG) minimizes hallucinations and ensures the tool remains aligned with the latest cybersecurity techniques and knowledge. We also highlight a unique infrastructure system that supports in-browser penetration testing, providing a robust platform for cybersecurity professionals. Our findings demonstrate that LLM Augmented Pentesting can not only significantly enhance task completion rates in penetration testing but also effectively addresses real-world challenges, marking a substantial advancement in the cybersecurity domain.

External Datasets

FARR Flow benchmark

bug-bounty reconnaissance

XSS

SQL injection

privilege escalation

boot2root server

Metasploit modules

MSFVenom modules