Analysis

Adding Structure to AI Harm

An Introduction to CSET's AI Harm Framework

Mia Hoffmann

Heather Frase

July 2023

Real-world harms caused by the use of AI technologies are widespread. Tracking and analyzing them improves our understanding of the variety of harms and the circumstances that lead to their occurrence once AI systems are deployed. This report presents a standardized conceptual framework for defining, tracking, classifying, and understanding harms caused by AI. It lays out the key elements required for the identification of AI harm, their basic relational structure, and definitions without imposing a single interpretation of AI harm. The brief concludes with an example of how to apply and customize the framework while keeping its modular structure.

Download Full Report

Executive Summary

Harms from the use of artificial intelligence systems (“AI harms”) are varied and widespread. Monitoring and examining these harms (AI harm analyses) are a critical step towards mitigating risks from AI. Such analyses directly inform AI risk mitigation efforts by improving our understanding of how AI systems cause harm, enabling earlier detection of emerging types of harm, and directing resources to where prevention is needed most.

This paper introduces the CSET AI Harm Framework, a standardized conceptual framework to support and facilitate analyses of AI harm. This framework improves the comparability of harm monitoring efforts by providing a common foundation that consistently identifies AI harms, while providing modularity to adapt to different analytical needs.

Key Components of CSET’s AI Harm Framework

The CSET AI Harm Framework lays out the key elements required for the identification of AI harm, their basic relational structure, and definitions without imposing a single interpretation of AI harm. Specifically, this framework:

Defines “AI harm’’ as when an entity experiences harm (or potential for harm) that is directly linked to the behavior of an AI system.
Groups harm into either tangible or intangible harm. Tangible harm is harm that is observable, verifiable, and definitive. Intangible harm is harm that cannot be directly observed or does not have any material or physical effect. Because of its observability, tangible harm is inherently easier to detect and identify. This means that tangible harm data is more consistent, less noisy, and easier to analyze.
Allows users to define additional categories of tangible and intangible harm. The CSET AI Harm Framework provides some common categories of harm, such as harm to physical health or safety, financial loss, property damage, detrimental content, bias and differential treatment, and violation of privacy, human and civil rights, or democratic norms. This framework also allows for the inclusion of new categories since new harm types could emerge in the future or be more relevant in another incident data-source.
Distinguishes harm that actually occurred from harm that may occur. Parsing and differentiating between harm that occurred and may occur allow for the tracking of realized harms, while also enabling research and analysis on potential harms that are risks and vulnerabilities.

In addition to providing introductions to the definitions and concepts of the CSET AI Harm Framework, this report also:

Discusses how users can adapt the framework. In order to apply the framework to data, users should create a customized framework. This requires specifying the framework’s components to such a degree that it can be used to extract all the information needed to identify and characterize AI harms according to the user’s analytic interests and the limitations of the data source.
Provides an example customized framework. As an example, this report shows how the CSET AI Harm Framework was customized for use in the CSET AI Harm Taxonomy for AIID. Since modifications and definitions are centrally documented in the CSET taxonomy, database users are able to retrace the underlying framework and compare it to other taxonomies built on the CSET AI Harm Framework.
Details future additions to the framework. Future versions of the CSET AI Harm Framework will incorporate content on the severity and spread of AI harm. When combined, these factors can inform the aggregated impact of a particular harm.

More detailed annotation guidelines are available on GitHub in the CSET AI Harm Taxonomy for AIID and Annotation Guide.

Understanding AI Harms: An Overview

August 2023

As policymakers decide how best to regulate AI, they first need to grasp the different types of harm that various AI applications might cause at the individual, national, and even societal levels. To better understand… Read More

Other

AI Incident Collection: An Observational Study of the Great AI Experiment

September 2023

This explainer defines criteria for effective AI Incident Collection and identifies tradeoffs between potential reporting models: mandatory, voluntary, and citizen reporting. Read More

Analysis

AI Accidents: An Emerging Threat

July 2021

As modern machine learning systems become more widely used, the potential costs of malfunctions grow. This policy brief describes how trends we already see today—both in newly deployed artificial intelligence systems and in older technologies—show… Read More

Analysis

One Size Does Not Fit All

February 2023

Artificial intelligence is so diverse in its range that no simple one-size-fits-all assessment approach can be adequately applied to it. AI systems have a wide variety of functionality, capabilities, and outputs. They are also created… Read More

Analysis

A Matrix for Selecting Responsible AI Frameworks

June 2023

Process frameworks provide a blueprint for organizations implementing responsible artificial intelligence (AI), but the sheer number of frameworks, along with their loosely specified audiences, can make it difficult for organizations to select ones that meet… Read More

Center for Security and Emerging Technology

Analysis

Adding Structure to AI Harm

An Introduction to CSET's AI Harm Framework

Executive Summary

Related Content

Understanding AI Harms: An Overview

AI Incident Collection: An Observational Study of the Great AI Experiment

AI Accidents: An Emerging Threat

One Size Does Not Fit All

A Matrix for Selecting Responsible AI Frameworks

Analysis

Adding Structure to AI Harm

An Introduction to CSET's AI Harm Framework

Executive Summary

Related Content

Understanding AI Harms: An Overview

AI Incident Collection: An Observational Study of the Great AI Experiment

AI Accidents: An Emerging Threat

One Size Does Not Fit All

A Matrix for Selecting Responsible AI Frameworks

This website uses cookies.