Hacking, The Lazy Way: LLM Augmented Pentesting

Authors: Dhruva Goyal, Sitaraman Subramanian, Aditya Peela, Nisha P. Shetty | Published: 2024-09-14 | Updated: 2025-05-19

2024.09.142025.05.27

Authors: Dhruva Goyal, Sitaraman Subramanian, Aditya Peela, Nisha P. Shetty
Published: 2024-09-14 | Updated: 2025-05-19

Source: https://arxiv.org/abs/2409.09493

PDF: https://arxiv.org/pdf/2409.09493

Labels Predicted by AI

Penetration Testing Applicability File Analysis Method Prompt Engineering

Please note that these labels were automatically added by AI. Therefore, they may not be entirely accurate.
For more details, please see the About the Literature Database page.

Abstract

In our research, we introduce a new concept called “LLM Augmented Pentesting” demonstrated with a tool named “Pentest Copilot,” that revolutionizes the field of ethical hacking by integrating Large Language Models (LLMs) into penetration testing workflows, leveraging the advanced GPT-4-turbo model. Our approach focuses on overcoming the traditional resistance to automation in penetration testing by employing LLMs to automate specific sub-tasks while ensuring a comprehensive understanding of the overall testing process. Pentest Copilot showcases remarkable proficiency in tasks such as utilizing testing tools, interpreting outputs, and suggesting follow-up actions, efficiently bridging the gap between automated systems and human expertise. By integrating a “chain of thought” mechanism, Pentest Copilot optimizes token usage and enhances decision-making processes, leading to more accurate and context-aware outputs. Additionally, our implementation of Retrieval-Augmented Generation (RAG) minimizes hallucinations and ensures the tool remains aligned with the latest cybersecurity techniques and knowledge. We also highlight a unique infrastructure system that supports in-browser penetration testing, providing a robust platform for cybersecurity professionals. Our findings demonstrate that LLM Augmented Pentesting can not only significantly enhance task completion rates in penetration testing but also effectively addresses real-world challenges, marking a substantial advancement in the cybersecurity domain.