GitHub - coastalcph/rungsted: Fast structured perceptron sequential labeler

Rungsted structured perceptron sequential tagger

Install

The software is installable via PyPI, e.g. do

pip install rungsted

At the moment, Rungsted only works on Python 3.

Demo

The repository contains a subset of the part-of-speech tagged Brown corpus. To run the structured perceptron labeler on this dataset, execute:

rungsted --train data/brown.train --test data/brown.test.vw

Rungsted's input format is closely modeled on the powerful and flexible format of Vowpal Wabbit, with the exception that Rungsted is perfectly fine with labels that are not integers.

Datasets

Provided you have a working installation of NLTK, you can recreate the Brown dataset with this command.

python rungsted/datasets/cr_brown_pos_data.py data/brown.train.vw data/brown.test.vw

There is also a script rungsted/datasets/conll_to_vw.py to convert from CONLL-formatted input to Rungsted

Building and uploading to PyPI

First, run CYTHON=1 python setup.py sdist to generate a source distribution. Then upload the distribution files to PyPI with twine: twine upload dist/*.

To develop locally, use CYTHON=1 python setup.py develop.

Name		Name	Last commit message	Last commit date
Latest commit History 103 Commits
data		data
rungsted		rungsted
tests		tests
.gitignore		.gitignore
.travis.yml		.travis.yml
MANIFEST.in		MANIFEST.in
README.md		README.md
build.sh		build.sh
clean.sh		clean.sh
setup.py		setup.py
valgrind-python.supp		valgrind-python.supp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Rungsted structured perceptron sequential tagger

Install

Demo

Datasets

Building and uploading to PyPI

About

Releases

Packages

Languages

coastalcph/rungsted

Folders and files

Latest commit

History

Repository files navigation

Rungsted structured perceptron sequential tagger

Install

Demo

Datasets

Building and uploading to PyPI

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages