The volume and speed of data generation in biomedical literature, social media, and other resources during the COVID-19 pandemic is unprecedented. This mountain of data is growing daily across PubMed, Twitter, Google Scholar, and the World Health Organization’s COVID-19 database,1 naming a few. … Beyond social media, the massive COVID-19 Open Research Dataset (CORD-19) has been assembled from tech giants like Microsoft, the Allen Institute for Artificial Intelligence, and Georgetown University’s Center for Security and Emerging Technology.2 This dataset houses over 12,000 full text articles in “machine-readable form” that can be ingested programmatically into computer software programs and analyzed using machine learning applications like natural language processing (NLP).
Read the full article at The Lancet.
- This resources was published by the World Health Organization. “Global research on coronavirus disease (COVID-19). https://www.who.int/emergencies/diseases/novel-coronavirus-2019/global-research-on-novel-coronavirus-2019-ncov
- This press release was issued the White House Office of Science and Technology Policy. “Call to Action to the Tech Community on New Machine Readable COVID-19 Dataset.” https://www.whitehouse.gov/briefings-statements/call-action-tech-community-new-machine-readable-covid-19-dataset/