Skip to content
/ ids Public

Ids of Dutch tweets containing the phrase covid or corona used by the project PuReGoME

License

Notifications You must be signed in to change notification settings

puregome/ids

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ids

Ids of tweets written in Dutch containing the phrase covid or corona used in the project PuReGoME. Ids are stored one on each line in zipped text files per day in folders per year. For example, the ids of tweets of Friday 13 March 2020 can be found in the file 2020/20200313.zip

The tweets have been selected with the regular expression covid|corona . The expression also selects words of which covid or corona is a part, like coronatest. The language of a tweet is determined by the Twitter metadata field lang. Only tweets with the lang value nl have been included.

When using data from this collection, please cite our project paper:

Shihan Wang, Marijn Schraagen, Erik Tjong Kim Sang and Mehdi Dastani, Public Sentiment on Governmental COVID-19 Measures in Dutch Social Media. In: Workshop on NLP for COVID-19 (Part 2) at EMNLP 2020 (NLP-COVID19-EMNLP), 20 November 2020.

About

Ids of Dutch tweets containing the phrase covid or corona used by the project PuReGoME

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published