mstnn

“Here lies a toppled god.

His fall was not a small one.

We did but build his pedestal,

A narrow and a tall one.”

Tleilaxu Epigram

usage

First one needs to train a model:

python manage.py train models/solresol data/solresol-train.conllu

Training is configurable: e.g. one can specify a development dataset to check the uas score against at the end of each training epoch; or specify a pre-trained lemma embeddings file; or exclude lemmas from the feature set altogether. Invoking python manage.py train --help lists the options.

Then one can proceed to parsing some fresh data using the trained model:

python manage.py parse models/solresol data/solresol-test.conllu output/solresol.conllu

The data, models, and output dirs are conveniently git-ignored.

There are a couple of other cli commands as well, python manage.py --help lists these.

setup

Something like this should do:

git clone && cd
venv path/to/envs/mstnn
source path/to/envs/mstnn/bin/activate
pip install -r requirements.txt
python manage.py unittest

The neural network is built entirely with Keras and the latter's backend should not matter.

idea

If you are here accidentally, but you are still here nonetheless: this is a graph-based dependency parser, a descendant of sorts of MSTParser. It uses a neural network to predict edge probabilities that are then fed into an implementation of the cool Chu–Liu/Edmond's algorithm in order to produce the most probable parse tree. It only works with CoNLL-U datasets but making it read other formats would be easy.

licence

MIT. Do as you please and praise the snake gods.

Name	Name	Last commit message	Last commit date
Latest commit pavelsof Better docstrings. May 11, 2017 2569f93 · May 11, 2017 History 121 Commits
code	code	Better docstrings.	May 11, 2017
data/UD_Basque	data/UD_Basque	Added UD_Basque.	Feb 7, 2017
.gitignore	.gitignore	First steps for the test command.	Mar 1, 2017
LICENSE	LICENSE	Better README; added LICENSE.	May 10, 2017
README.rst	README.rst	Better README; added LICENSE.	May 10, 2017
manage.py	manage.py	Cli is born.	Feb 4, 2017
requirements.txt	requirements.txt	Revert "Updated Keras to v2."	May 4, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mstnn

usage

setup

idea

licence

About

Releases

Languages

License

pavelsof/mstnn

Folders and files

Latest commit

History

Repository files navigation

mstnn

usage

setup

idea

licence

About

Resources

License

Stars

Watchers

Forks

Releases

Languages