Can Open-Source LLM Agents Replace Static Application Security Testing Tools? An Empirical Assessment

TOP 文献データベース Can Open-Source LLM Agents Replace Static Application Security Testing Tools? An Empirical Assessment

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2606.11672

PDF

https://arxiv.org/pdf/2606.11672

文献情報

作者: Derek Yohn,Luke Flancher,Mirajul Islam,Khaled Slhoub
公開日: 2026-6-10
所属機関: College of Engineering and Science, Florida Institute of Technology
所属の国: United States of America
会議名

AIにより推定されたラベル

データ駆動型脆弱性評価インダイレクトプロンプトインジェクション静的アプリケーションセキュリティテスト

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

This paper explores the value of agentic AI tools for cybersecurity purposes. We evaluate the efficacy of a general-purpose GenAI Large Language Model- (GenAI-) based agent when powered by three different Ollama-hosted general-purpose open source models. We assess each agent's performance using precision, recall, false positive count, and a calculated composite score based upon the interplay of the captured metrics, against the baseline performance of an existing, vetted Static Application Security Testing (SAST) tool, Bandit. Our findings refute the notion that a modern open-source GenAI LLM-based agent is currently suitable for the specialized task of SAST scanning under realistic conditions.

外部データセット

Beaverhabits

Fail2ban

Yum