Bayesian Text classification service based on Redis. Built on top of Tornado and React.js
JavaScript Python CSS HTML
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
klassify
.babelrc
.gitattributes
.gitignore
LICENSE
MANIFEST.in
README.md
dev.html
package.json
requirements.txt
server.js
setup.py
webpack.config.js
webpack.production.js

README.md

Klassify

Redis based text classification service with real-time web interface.

What is Text Classification

Text classification, document classification or document categorization is a problem in library science, information science and computer science. The task is to assign a document to one or more classes or categories.

https://en.wikipedia.org/wiki/Document_classification

There are many use cases of document classifiers in real world:

  • Spam filters
  • Web page classification
  • News and and topic categorization
  • Sentiment Analysis

Demo

klassify

Installing

sudo pip install klassify

If you don't have an nltk corpus, you'll need to run this:

python -c 'import nltk; nltk.download("stopwords")' 

You'll also need redis installed, check if you have it installed by running this command:

redis-server

If you get a command not found, follow these instructions

Installing on Virtualenv

virtualenv foo
source foo/bin/activate
pip install klassify
python -m klassify

Usage

python -m klassify

Command line options:

  --port                           run on the given port (default 8888)
  --prefix                         prefix that will be used in redis keys
                                   (default klassify)
  --redis-db                       redis database (default 0)
  --redis-host                     redis host (default localhost)
  --redis-port                     redis port (default 6379)