Benchmarking Practices in LLM-driven Offensive Security: Testbeds, Metrics, and Experiment Design Authors: Andreas Happe, Jürgen Cito | Published: 2025-04-14 TestbedPrompt validationProgress Tracking 2025.04.14 2025.05.27 Literature Database