Tweet Corpora

The original tweets may be obtained using the code snippet like download_tweets.py in http://diego.asu.edu/downloads/publications/ADRMine/download_tweets.zip.

Description: Personal experience tweets related to dietary supplements
Dataset (8,770 tweets): https://github.com/medeffects/tweet_corpora/blob/master/SupplementTweetCorpus8770-20160704.csv
Reference: Jiang, K., Calix, R.A., & Gupta, M. (2016). Construction of a Personal Experience Tweet Corpus for Health Surveillance. In Proceedings of the 15th Workshop on Biomedical Natural Language Processing (pp. 128-135). http://www.aclweb.org/anthology/W16-2917

Description: Personal experience tweets related to medications
Dataset (12,331 tweets combined): https://github.com/medeffects/tweet_corpora/blob/master/MedicineCorpusTrainingSet8612-20170501.csv and https://github.com/medeffects/tweet_corpora/blob/master/MedicineCorpusTestSet3719-20170501.csv
Reference: Jiang, K., Feng, S., Calix, R.A., & Gupta, M., Bernard, G.R. (2018). Identifying Tweets of Personal Health Experience through Word Embedding and LTSM. In BMC Bioinformatics 19(Suppl 8):210. https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-018-2198-y.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.gitignore		.gitignore
MedicineCorpusTestSet3719-20170501.csv		MedicineCorpusTestSet3719-20170501.csv
MedicineCorpusTrainingSet8612-20170501.csv		MedicineCorpusTrainingSet8612-20170501.csv
README.md		README.md
SupplementTweetCorpus8770-20160704.csv		SupplementTweetCorpus8770-20160704.csv

Provide feedback