Purple Llama CyberSecEval: A Secure Coding Benchmark for Language Models

TOP 文献データベース Purple Llama CyberSecEval: A Secure Coding Benchmark for Language Models

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2312.04724

PDF

https://arxiv.org/pdf/2312.04724

文献情報

作者: Manish Bhatt;Sahana Chennabasappa;Cyrus Nikolaidis;Shengye Wan;Ivan Evtimov;Dominik Gabi;Daniel Song;Faizan Ahmad;Cornelius Aschermann;Lorenzo Fontana;Sasha Frolov;Ravi Prakash Giri;Dhaval Kapil;Yiannis Kozyrakis;David LeBlanc;James Milazzo;Aleksandar Straumann;Gabriel Synnaeve;Varun Vontimitta;Spencer Whitman;Joshua Saxe
公開日: 2023-12-8
所属機関: Meta
所属の国: United States of America
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

サイバーセキュリティプロンプトインジェクション LLMセキュリティ

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

This paper presents CyberSecEval, a comprehensive benchmark developed to help bolster the cybersecurity of Large Language Models (LLMs) employed as coding assistants. As what we believe to be the most extensive unified cybersecurity safety benchmark to date, CyberSecEval provides a thorough evaluation of LLMs in two crucial security domains: their propensity to generate insecure code and their level of compliance when asked to assist in cyberattacks. Through a case study involving seven models from the Llama 2, Code Llama, and OpenAI GPT large language model families, CyberSecEval effectively pinpointed key cybersecurity risks. More importantly, it offered practical insights for refining these models. A significant observation from the study was the tendency of more advanced models to suggest insecure code, highlighting the critical need for integrating security considerations in the development of sophisticated LLMs. CyberSecEval, with its automated test case generation and evaluation pipeline covers a broad scope and equips LLM designers and researchers with a tool to broadly measure and enhance the cybersecurity safety properties of LLMs, contributing to the development of more secure AI systems.

外部データセット

SecurityEval