Skip to content
Corpora used to create pickled objects, in mohitranka/TwitterSentiment
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
README
corpora.7z

README

TwitterSentimentCorpora
=======================

The corpora used to create pickled objects to be used at http://github.com/mohitranka/TwitterSentiment. It contains following corpora.

1. stopwords - (Downloadable via nltk.download()), contains stopwords of different languages.

2. movie_reviews - (Downloadable via nltk.download()), contains 1000 positive movie reviews and 1000 negative movie reviews. (Under "pos" and "neg" directories respectively) 

3. tweets_train - (A modified version of selected data from http://www.stanford.edu/~alecmgo/cs224n/twitterdata.2009.05.25.c.zip), containing 6215 postive tweets and 6909 negative tweets. (Under "pos" and "neg" directories, respectively.)

4. tweets_test - 211 positive tweets and 38 negative tweets about the movie "Inception" (Under "pos" and "neg" directories, respectively). Gathered using Twitter Search API and fflick.com, and classified manually. 
You can’t perform that action at this time.