Data Brief

Using Machine Learning to Fill Gaps in Chinese AI Market Data

Supervised Learning Finds AI-Related Activity That Leading Datasets Miss

Zachary Arnold,

Joanne Boisson,

Lorenzo Bongiovanni,

Daniel Chou,

Carrie Peelman,

and Ilya Rahkovsky

February 2021

In this proof-of-concept project, CSET and Amplyfi Ltd. used machine learning models and Chinese-language web data to identify Chinese companies active in artificial intelligence. Most of these companies were not labeled or described as AI-related in two high-quality commercial datasets. The authors' findings show that using structured data alone—even from the best providers—will yield an incomplete picture of the Chinese AI landscape.

Download Full Report