CSET

AI Safety Evaluations: An Explainer

Jessica Ji,

Vikram Venkatram,

and Steph Batalis

May 28, 2025

Effectively evaluating AI models is more crucial than ever. But how do AI evaluations actually work? Our explainer lays out the different fundamental types of AI safety evaluations alongside their respective strengths and limitations.