Skip to content

A library for sentiment analysis in dictionary framework.

License

Notifications You must be signed in to change notification settings

nickderobertis/pysentiment

 
 

Repository files navigation

pysentiment

Overview

This is a library for sentiment analysis in dictionary framework. Two dictionaries are provided in the library, namely, Harvard IV-4 and Loughran and McDonald Financial Sentiment Dictionaries, which are sentiment dictionaries for general and financial sentiment analysis.

See also http://www.wjh.harvard.edu/~inquirer/ and https://www3.nd.edu/~mcdonald/Word_Lists.html .

Introduction

Positive and Negative are word counts for the words in positive and negative sets.

Polarity and Subjectivity are calculated in the same way of Lydia system. See also http://www.cs.sunysb.edu/~skiena/lydia/

Getting Started

Install pysentiment2:

pip install pysentiment2

A simple example:

import pysentiment2

# Do something with pysentiment2

Usage

To use the Harvard IV-4 dictionary, create an instance of the HIV4 class

import pysentiment2 as ps
hiv4 = ps.HIV4()
tokens = hiv4.tokenize(text)  # text can be tokenized by other ways
                                  # however, dict in HIV4 is preprocessed
                                  # by the default tokenizer in the library
score = hiv4.get_score(tokens)

HIV4 is a subclass for pysentiment2.base.BaseDict. BaseDict can be inherited by implmenting init_dict to initialize _posset and _negset for the dictionary to calculate 'positive' or 'negative' scores for terms.

Similarly, to use the Loughran and McDonald dictionary:

import pysentiment2 as ps
lm = ps.LM()
tokens = lm.tokenize(text)
score = lm.get_score(tokens)

Links

See the documentation here.

Author

pysentiment2 created by Nick DeRobertis but based on pysentiment by Zhichao Han. GNU GPL License.

About

A library for sentiment analysis in dictionary framework.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages

  • Python 90.1%
  • Shell 3.6%
  • Makefile 2.7%
  • Batchfile 2.1%
  • CSS 1.5%