Skip to content


Folders and files

Last commit message
Last commit date

Latest commit


Repository files navigation


The final version of the latest dataset paper with detailed analysis on the dataset can be found at here

This is the first version of the dataset and will be updated timely.


The complete dataset cannot be distributed because of Twitter privacy policies and news publisher copyrights. Social engagements and user information are not disclosed because of Twitter Policy.

The dataset provided in this repository (located in dataset folder) includes the following files:

Each of the above CSV files is a comma-separated file and have the following respective columns:

  1. recovery-news-data.csv
  • news_id - Unique identifier for each news article.
  • url - URL of the article from the website that published respective news.
  • publisher - Publisher of the news article.
  • publish_date - The publishing date of the news article.
  • author - Author or authors of the article. This field is a list of names of authors separated by a comma.
  • title - Title of the news article.
  • image - The head image of the news article.
  • body_text - The complete body content of the news article.
  • political_bias - Political bias for each news source.
  • country - The country of the news source.
  • reliability - reliability label of the news article (1 = reliable, 0 = unreliable).
  1. recovery-social-media-data.csv
  • news_id - Unique identifier for each news article.
  • tweet_id- Unique identifier for every tweet.



Twitter data is gathered using Twitter Developer account and API keys. The twitter developer account can be created at []. Once the account is created, you can create the app. On successful creation of the app, the keys will be available in the keys and tokens section of the app.

Twitter data is gathered using premium search APIs.

Hydrator can be used to rehydrate Tweet ids

Steps to Hydrate:

  1. Navigate to hydrator and follow readme OR download the installer from hydrator executable
  2. Run the installer and open the application
  3. Link the twitter account in the settings tab
  4. In the dataset section, upload the file containing just the tweets. It will download csv with twitter information


If you are using this dataset, please cite the following paper:

  title={ReCOVery: A Multimodal Repository for COVID-19 News Credibility Research},
  author={Zhou, Xinyi and Mulay, Apurva and Ferrara, Emilio and Zafarani, Reza},
  booktitle={Proceedings of the 29th ACM International Conference on Information & Knowledge Management},


Please contact if you have any question on the paper, data or the code.


A Multimodal Repository for COVID-19 News Credibility Research







No releases published


No packages published

Contributors 4