Train NLTK objects with zero code
Python Shell
Latest commit e245bd5 Aug 5, 2015 @japerk Merge pull request #30 from aleksandrpanteleymonov/master
Fix analyze_classifier_coverage when using --ngrams parameter
Permalink
Failed to load latest commit information.
docs cleanup Sep 15, 2014
nltk_trainer tagset argument for train_chunker, remove babelfish references Apr 17, 2015
tests tagset argument for train_chunker, remove babelfish references Apr 17, 2015
.gitignore ignore compiled python files Oct 27, 2013
.hgignore initial docs Jul 31, 2011
LICENSE apache license, initial setup.py, shebang in scripts Feb 27, 2011
README.rst links to text-processing.com & NLTK 3 cookbook Sep 15, 2014
analyze_chunked_corpus.py analyze chunked corpus works for v3 Jan 5, 2014
analyze_chunker_coverage.py analyze chunker coverage working for v3 Jan 5, 2014
analyze_classifier_coverage.py Fix analyze_classifier_coverage when using --ngrams parameter Aug 1, 2015
analyze_tagged_corpus.py analyze tagger coverage v3 Jan 5, 2014
analyze_tagger_coverage.py analyze tagger coverage v3 Jan 5, 2014
categorized_corpus2csv.py Correctly pull in the environment python Jan 28, 2013
classify_corpus.py tagset argument for train_chunker, remove babelfish references Apr 17, 2015
combine_classifiers.py Correctly pull in the environment python Jan 28, 2013
requirements.txt fix numpy requirement Dec 22, 2012
setup.py include all nltk_trainer packages Nov 29, 2014
tag_phrases.py Correctly pull in the environment python Jan 28, 2013
train_chunker.py tagset argument for train_chunker, remove babelfish references Apr 17, 2015
train_classifier.py Cleaner method using os.path.split Jul 13, 2014
train_tagger.py train tagger tagset option, v3 specific test script Dec 29, 2013

README.rst

NLTK Trainer

NLTK Trainer exists to make training and evaluating NLTK objects as easy as possible.

Requirements

You must have Python >=2.6 (but not 3.x) with argparse and NLTK 2.0 installed. NumPy, SciPy, and megam are recommended for training Maxent classifiers. To use the sklearn classifiers, you must also install scikit-learn.

If you want to use any of the corpora that come with NLTK, you should install the NLTK data.

Documentation

Documentation can be found at nltk-trainer.readthedocs.org (you can also find these documents in the docs directory. Many of the scripts are covered in Python 3 Text Processing with NLTK 3 Cookbook, and every script provides a --help option that describes all available parameters.