Skip to content

Latest commit

 

History

History
50 lines (36 loc) · 2.14 KB

README.md

File metadata and controls

50 lines (36 loc) · 2.14 KB

HPV_Twitter_Corpus

We provide the annotations for the HPV vaccine Twitter corpus that was used in "Use of deep learning to analyze Social Media discussions about the Human Papillomavirus Vaccine", as well as a Python script to download the tweets.

NOTE

Please note that by downloading the Twitter data you agree to follow the Twitter terms of service (https://twitter.com/tos?lang=en), which requires not to redistribute the data and to delete tweets that are marked deleted in the future. You MUST NOT re-distribute the tweets, as this violates the Twitter Terms of Use.

DATA

The annotations for Tweets corpus are saved in "hpv_twitter_annotation.tsv".

The file contain tab separated information about Tweet IDs and their annotations for each theoretical constructs of behavior change theories.

DOWNLOAD

Before you download the Twitter data, please first register as a Twitter developer and get Twitter API keys and tokens: https://developer.twitter.com/en/docs/basics/authentication/overview

In "downloadTweet.py", please specify the keys and tokens from Twitter API and change file path for both input and output files.

Note that some tweets are not public available currently. In the output file, we leave the text blank if it is not available.

CONTACT

Jingcheng Du, jingcheng.du@uth.tmc.edu Cui Tao, cui.tao@uth.tmc.edu

RESEARCH GROUP INFORMATION

Center for Biomedical Semantics and Data Intelligence School of Biomedical Informatics The University of Texas Health Science Center at Houston https://sbmi.uth.edu/bsdi/

CITE

Please cite the following article, if the data is useful for your project.

@article{du2020use,
  title={Use of Deep Learning to Analyze Social Media Discussions About the Human Papillomavirus Vaccine},
  author={Du, Jingcheng and Luo, Chongliang and Shegog, Ross and Bian, Jiang and Cunningham, Rachel M and Boom, Julie A and Poland, Gregory A and Chen, Yong and Tao, Cui},
  journal={JAMA Network Open},
  volume={3},
  number={11},
  pages={e2022025--e2022025},
  year={2020},
  publisher={American Medical Association}
}

UPDATE DATE: 11/17/2020