Skip to content

Transforms Spacy English parser output to a Universal Dependency (UD) document

Notifications You must be signed in to change notification settings

anatoleg/spacy_ud

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

spacy_ud

Transforms Spacy English parser output to a Universal Dependency (UD) document

Spacy English parser produces a Spacy document

This code transforms SpacyDocs into UdDocs

Usage:

If you already have a Spacy document (output of Spacy), use this to create a UdDoc:

ud_doc = spacy_to_ud_doc(spacy_doc)

to print a UdDoc, use:

ud_doc.print_doc()

To create a Spacy doc, first create an instance of SpacyParser

spr = SpacyParser()

To get a Spacy doc, use

spacy_doc = spr.nlp('Spacy parser is pretty good')

To print a Spacy doc, use:

print_spacy_doc(spacy_doc)

For more information, see comments inside main.py

ud_sentences.txt contains 197 sentences taken mostly from examples in https://universaldependencies.org/u/dep/all.html

They should cover most of the English dependencies

ud_sentences_parses_curated.txt contains Stanza parses which were manually curated to conform to UD. Stanza is not perfect.

spacy_ud was developed using these sentences.

About

Transforms Spacy English parser output to a Universal Dependency (UD) document

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages