Skip to content

We provide the annotations for the HPV vaccine Twitter corpus that was used in "Use of deep learning to analyze Social Media discussions about the Human Papillomavirus Vaccine", as well as a Python script to download the tweets.

Tao-AI-group/HPV_Twitter_Corpus

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

HPV_Twitter_Corpus

We provide the annotations for the HPV vaccine Twitter corpus that was used in "Use of deep learning to analyze Social Media discussions about the Human Papillomavirus Vaccine", as well as a Python script to download the tweets.

NOTE

Please note that by downloading the Twitter data you agree to follow the Twitter terms of service (https://twitter.com/tos?lang=en), which requires not to redistribute the data and to delete tweets that are marked deleted in the future. You MUST NOT re-distribute the tweets, as this violates the Twitter Terms of Use.

DATA

The annotations for Tweets corpus are saved in "hpv_twitter_annotation.tsv".

The file contain tab separated information about Tweet IDs and their annotations for each theoretical constructs of behavior change theories.

DOWNLOAD

Before you download the Twitter data, please first register as a Twitter developer and get Twitter API keys and tokens: https://developer.twitter.com/en/docs/basics/authentication/overview

In "downloadTweet.py", please specify the keys and tokens from Twitter API and change file path for both input and output files.

Note that some tweets are not public available currently. In the output file, we leave the text blank if it is not available.

CONTACT

Jingcheng Du, jingcheng.du@uth.tmc.edu Cui Tao, cui.tao@uth.tmc.edu

RESEARCH GROUP INFORMATION

Center for Biomedical Semantics and Data Intelligence School of Biomedical Informatics The University of Texas Health Science Center at Houston https://sbmi.uth.edu/bsdi/

CITE

Please cite the following article, if the data is useful for your project.

@article{du2020use,
  title={Use of Deep Learning to Analyze Social Media Discussions About the Human Papillomavirus Vaccine},
  author={Du, Jingcheng and Luo, Chongliang and Shegog, Ross and Bian, Jiang and Cunningham, Rachel M and Boom, Julie A and Poland, Gregory A and Chen, Yong and Tao, Cui},
  journal={JAMA Network Open},
  volume={3},
  number={11},
  pages={e2022025--e2022025},
  year={2020},
  publisher={American Medical Association}
}

UPDATE DATE: 11/17/2020

About

We provide the annotations for the HPV vaccine Twitter corpus that was used in "Use of deep learning to analyze Social Media discussions about the Human Papillomavirus Vaccine", as well as a Python script to download the tweets.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages