Skip to content


Subversion checkout URL

You can clone with
Download ZIP
Fast structured perceptron sequential labeler
Python C++ C Shell
Latest commit 524d1fe @andersjo andersjo add CYTHON in env variable

Build Status

Rungsted structured perceptron sequential tagger


The software is installable via PyPI, e.g. do

pip install rungsted


The repository contains a subset of the part-of-speech tagged Brown corpus. To run the structured perceptron labeler on this dataset, execute:

python src/ --train data/brown.train --test data/brown.test.vw

Rungsted's input format is closely modeled on the powerful and flexible format of Vowpal Wabbit, with the exception that Rungsted is perfectly fine with labels that are not integers.


Provided you have a working installation of NLTK, you can recreate the Brown dataset with this command.

python rungsted/datasets/ data/brown.train.vw data/brown.test.vw

There is also a script rungsted/datasets/ to convert from CONLL-formatted input to Rungsted

Building and uploading to PyPI

First, run CYTHON=1 python sdist to generate a source distribution. Then upload the distribution files to PyPI with twine: twine upload dist/*.

To develop locally, use CYTHON=1 python develop.

Something went wrong with that request. Please try again.