Analysis

Cybersecurity Risks of AI-Generated Code

Jessica Ji,

Jenny Jun,

Maggie Wu,

and Rebecca Gelles

November 2024

Artificial intelligence models have become increasingly adept at generating computer code. They are powerful and promising tools for software development across many industries, but they can also pose direct and indirect cybersecurity risks. This report identifies three broad categories of risk associated with AI code generation models and discusses their policy and cybersecurity implications.

Download Full Report

Executive Summary

Recent developments have improved the ability of large language models (LLMs) and other AI systems to generate computer code. While this is promising for the field of software development, these models can also pose direct and indirect cybersecurity risks. In this paper, we identify three broad categories of risk associated with AI code generation models: 1) models generating insecure code, 2) models themselves being vulnerable to attack and manipulation, and 3) downstream cybersecurity impacts such as feedback loops in training future AI systems.

Existing research has shown that, under experimental conditions, AI code generation models frequently output insecure code. However, the process of evaluating the security of AI-generated code is highly complex and contains many interdependent variables. To further explore the risk of insecure AI-written code, we evaluated generated code from five LLMs. Each model was given the same set of prompts, which were designed to test likely scenarios where buggy or insecure code might be produced. Our evaluation results show that almost half of the code snippets produced by these five different models contain bugs that are often impactful and could potentially lead to malicious exploitation. These results are limited to the narrow scope of our evaluation, but we hope they can contribute to the larger body of research surrounding the impacts of AI code generation models.

Given both code generation models’ current utility and the likelihood that their capabilities will continue to improve, it is important to manage their policy and cybersecurity implications. Key findings include the below.

Industry adoption of AI code generation models may pose risks to software supply chain security. However, these risks will not be evenly distributed across organizations. Larger, more well-resourced organizations will have an advantage over organizations that face cost and workforce constraints.
Multiple stakeholders have roles to play in helping to mitigate potential security risks related to AI-generated code. The burden of ensuring that AI-generated code outputs are secure should not rest solely on individual users, but also on AI developers, organizations producing code at scale, and those who can improve security at large, such as policymaking bodies or industry leaders. Existing guidance such as secure software development practices and the NIST Cybersecurity Framework remains essential to ensure that all code, regardless of authorship, is evaluated for security before it enters production. Other cybersecurity guidance, such as secure-by-design principles, can be expanded to include code generation models and other AI systems that impact software supply chain security.
Code generation models also need to be evaluated for security, but it is currently difficult to do so. Evaluation benchmarks for code generation models often focus on the models’ ability to produce functional code but do not assess their ability to generate secure code, which may incentivize a deprioritization of security over functionality during model training. There is inadequate transparency around models’ training data—or understanding of their internal workings—to explore questions such as whether better performing models produce more insecure code.

Download Full Report

Cybersecurity Risks of AI-Generated Code

Authors

Jessica Ji Jenny Jun Maggie Wu Rebecca Gelles

Topics

CyberAI

Cybersecurity of AI Systems

Citation

Jessica Ji, Jenny Jun, Maggie Wu, and Rebecca Gelles, "Cybersecurity Risks of AI-Generated Code" (Center for Security and Emerging Technology, November 2024). https://doi.org/10.51593/2023CA010

Analysis

Through the Chat Window and Into the Real World: Preparing for AI Agents

October 2024

Computer scientists have long sought to build systems that can actively and autonomously carry out complicated goals in the real world—commonly referred to as artificial intelligence "agents." Recently, significant progress in large language models has… Read More

Evaluating Large Language Models

July 2024

Researchers, companies, and policymakers have dedicated increasing attention to evaluating large language models (LLMs). This explainer covers why researchers are interested in evaluations, as well as some common evaluations and associated challenges. While evaluations can… Read More

Analysis

Securing AI

March 2022

Like traditional software, vulnerabilities in machine learning software can lead to sabotage or information leakages. Also like traditional software, sharing information about vulnerabilities helps defenders protect their systems and helps attackers exploit them. This brief… Read More

Analysis

Poison in the Well

June 2021

Modern machine learning often relies on open-source datasets, pretrained models, and machine learning libraries from across the internet, but are those resources safe to use? Previously successful digital supply chain attacks against cyber infrastructure suggest… Read More

How the U.S. Wins the Global Tech Competition

Center for Security and Emerging Technology

How the U.S. Wins the Global Tech Competition

Analysis

Cybersecurity Risks of AI-Generated Code

Executive Summary

Download Full Report

Related Content

Through the Chat Window and Into the Real World: Preparing for AI Agents

Evaluating Large Language Models

Securing AI

Poison in the Well

How the U.S. Wins the Global Tech Competition

Analysis

Cybersecurity Risks of AI-Generated Code

Executive Summary

Download Full Report

Related Content

Through the Chat Window and Into the Real World: Preparing for AI Agents

Evaluating Large Language Models

Securing AI

Poison in the Well

This website uses cookies.