Skip to content
This repository has been archived by the owner on Jun 21, 2022. It is now read-only.
/ twitterscraper Public archive

Listens for tweets of a given hashtag, analyzes them, and stores the results in DynamoDB.

Notifications You must be signed in to change notification settings

stoksc/twitterscraper

Repository files navigation

Build Status Coverage Status

twitterscraper

Listens for tweets of a given hashtag, analyzes them, and stores the restores in DynamoDB.

Getting Started

To get started, clone this repository (git clone https://github.com/stoksc/twitterscraper).

Then make a new Python3.6 Virtual Environment with virtualenv (virtualenv -p python3.6 venv), activate it (source venv/bin/activate), and install the package with pip install ..

After this, you'll need to set some environment variables for the program to use. Specifically, you need AWS_AK, AWS_SK, TWITTER_CK, TWITTER_CS, TWITTER_AT, TWITTER_ATS which are your AWS access key and secret key (IAM users with DynamoDB full access) and your Twitter API consumer key, consumer secret, access token and access token secret.

Then just run the program followed by the keywords you want to track:

python twitterscraper/app.py keyword1 keyword2

Some poorly selected metadata from the tweets will be stored every 10-minutes into a DynamoDB table.

Built With
  • boto3 - Boto is the Amazon Web Services (AWS) SDK for Python, which allows Python developers to write software that makes use of Amazon services like S3 and EC2. Boto provides an easy to use, object-oriented API as well as low-level direct service access..
  • tweepy - Python3 wrapper for the Twitter API.
  • nltk - NLTK is a leading platform for building Python programs to work with human language data.
Author

About

Listens for tweets of a given hashtag, analyzes them, and stores the results in DynamoDB.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages