Permalink
Switch branches/tags
Nothing to show
Find file
Fetching contributors…
Cannot retrieve contributors at this time
13 lines (7 sloc) 911 Bytes
TwitterSentimentCorpora
=======================
The corpora used to create pickled objects to be used at http://github.com/mohitranka/TwitterSentiment. It contains following corpora.
1. stopwords - (Downloadable via nltk.download()), contains stopwords of different languages.
2. movie_reviews - (Downloadable via nltk.download()), contains 1000 positive movie reviews and 1000 negative movie reviews. (Under "pos" and "neg" directories respectively)
3. tweets_train - (A modified version of selected data from http://www.stanford.edu/~alecmgo/cs224n/twitterdata.2009.05.25.c.zip), containing 6215 postive tweets and 6909 negative tweets. (Under "pos" and "neg" directories, respectively.)
4. tweets_test - 211 positive tweets and 38 negative tweets about the movie "Inception" (Under "pos" and "neg" directories, respectively). Gathered using Twitter Search API and fflick.com, and classified manually.