LLMSecEval: A Dataset of Natural Language Prompts for Security Evaluations

Tony, Catherine; Mutas, Markus; Ferreyra, Nicolás E. Díaz; Scandariato, Riccardo

Computer Science > Software Engineering

arXiv:2303.09384 (cs)

[Submitted on 16 Mar 2023]

Title:LLMSecEval: A Dataset of Natural Language Prompts for Security Evaluations

Authors:Catherine Tony, Markus Mutas, Nicolás E. Díaz Ferreyra, Riccardo Scandariato

View PDF

Abstract:Large Language Models (LLMs) like Codex are powerful tools for performing code completion and code generation tasks as they are trained on billions of lines of code from publicly available sources. Moreover, these models are capable of generating code snippets from Natural Language (NL) descriptions by learning languages and programming practices from public GitHub repositories. Although LLMs promise an effortless NL-driven deployment of software applications, the security of the code they generate has not been extensively investigated nor documented. In this work, we present LLMSecEval, a dataset containing 150 NL prompts that can be leveraged for assessing the security performance of such models. Such prompts are NL descriptions of code snippets prone to various security vulnerabilities listed in MITRE's Top 25 Common Weakness Enumeration (CWE) ranking. Each prompt in our dataset comes with a secure implementation example to facilitate comparative evaluations against code produced by LLMs. As a practical application, we show how LLMSecEval can be used for evaluating the security of snippets automatically generated from NL descriptions.

Comments:	Accepted at MSR '23 Data and Tool Showcase Track
Subjects:	Software Engineering (cs.SE); Information Retrieval (cs.IR); Machine Learning (cs.LG)
Cite as:	arXiv:2303.09384 [cs.SE]
	(or arXiv:2303.09384v1 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2303.09384

Submission history

From: Nicolas E. Diaz Ferreyra PhD [view email]
[v1] Thu, 16 Mar 2023 15:13:58 UTC (917 KB)

Computer Science > Software Engineering

Title:LLMSecEval: A Dataset of Natural Language Prompts for Security Evaluations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:LLMSecEval: A Dataset of Natural Language Prompts for Security Evaluations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators