Data Brief

Exploring Clusters of Research in Three Areas of AI Safety

Using the CSET Map of Science

Helen Toner

and Ashwin Acharya

February 2022

Problems of AI safety are the subject of increasing interest for engineers and policymakers alike. This brief uses the CSET Map of Science to investigate how research into three areas of AI safety — robustness, interpretability and reward learning — is progressing. It identifies eight research clusters that contain a significant amount of research relating to these three areas and describes trends and key papers for each of them.

Download Full Report