Skip to content

Commit

Permalink
list formatting
Browse files Browse the repository at this point in the history
  • Loading branch information
japerk committed Feb 4, 2011
1 parent 328a6b8 commit 0709a3a
Showing 1 changed file with 6 additions and 5 deletions.
11 changes: 6 additions & 5 deletions README.rst
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,7 @@ NLTK Trainer exists to make training and evaluating NLTK objects as easy as poss
Requirements
------------

You must have Python 2.6 with `argparse <http://docs.python.org/library/argparse.html>`_ and `NLTK <http://www.nltk.org/>`_ 2.0 installed. `NumPy <http://numpy.scipy.org/>`_, `SciPy <http://www.scipy.org/>`_, and `megam <http://www.cs.utah.edu/~hal/megam/>`_ are recommended for training Maxent classifiers.
You must have Python 2.6 with `argparse <http://pypi.python.org/pypi/argparse/>`_ and `NLTK <http://www.nltk.org/>`_ 2.0 installed. `NumPy <http://numpy.scipy.org/>`_, `SciPy <http://www.scipy.org/>`_, and `megam <http://www.cs.utah.edu/~hal/megam/>`_ are recommended for training Maxent classifiers.


Training Classifiers
Expand Down Expand Up @@ -90,10 +90,11 @@ Analyzing a Tagged Corpus
-------------------------

The ``analyze_tagged_corpus.py`` script will show the following statistics about a tagged corpus:
* total number of words
* number of unique words
* number of tags
* the number of times each tag occurs

* total number of words
* number of unique words
* number of tags
* the number of times each tag occurs

To analyze the treebank corpus::
python analyze_tagged_corpus.py treebank
Expand Down

0 comments on commit 0709a3a

Please sign in to comment.