Skip to content

danielgalbraith/DepTrain

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DepTrain

Training dependency parser with spaCy in Python3.

Task to ensure rapper name 'Stefflon Don' tagged appropriately. Includes training sentences and test paragraph mentioning name. Uses standard spaCy English model, i.e. CNN trained on OntoNotes, with GloVe vectors trained on Common Crawl. Command to run training:

python train.py -m en [-n] [-o]

Optional arguments for number of iterations e.g. [-n 30] and output directory [-o /path/to/dir]. Visualizes dependency parse of test paragraph using displaCy, by default port 5000 (localhost:5000 in browser).

Repo contains source texts parsed by Stanford CoreNLP in .conllu format, but training and test data already included in train.py.

Requires spaCy 2.0+ and Python 3.0+.

About

Training dependency parser with spaCy in Python 3

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages