Skip to content

Latest commit

 

History

History
62 lines (43 loc) · 2.72 KB

index.rst

File metadata and controls

62 lines (43 loc) · 2.72 KB

Welcome to NLTK-Trainer's documentation!

NLTK-Trainer is a set of Python command line scripts for natural language processing. With these scripts, you can do the following things without writing a single line of code:

  1. train NLTK based models
  2. evaluate pickled models against a corpus
  3. analyze a corpus

These scripts are Python 2 & 3 compatible and work with NLTK 2.0.4 and higher.

Download

The scripts can be downloaded from nltk-trainer on github.

Documentation

train_classifier.rst train_tagger.rst train_chunker.rst analyze_tagged_corpus.rst analyze_tagger_coverage.rst

Books

Python 3 Text Processing with NLTK 3 Cookbook

Python 3 Text Processing with NLTK 3 Cookbook contains many examples for training NLTK models with & without NLTK-Trainer.

  • Chapter 4 covers part-of-speech tagging and train_tagger.py <train_tagger>.
  • Chapter 5 shows how to train phrase chunkers and use train_chunker.py <train_chunker>.
  • Chapter 7 demonstrates classifier training and train_classifier.py <train_classifier>.

Articles

Demos and APIs

Nearly all the models that power the text-processing.com NLTK demos and NLP APIs have been trained using NLTK-Trainer.

Indices and tables

  • genindex
  • modindex
  • search