πŸ’« Runtime performance comparison of spaCy against other NLP libraries
Python
Latest commit 4139f2d Oct 30, 2016 @ines ines committed on GitHub Update README.md
Permalink
Failed to load latest commit information.
bin * Add output to run_spacy Jun 8, 2015
fabfile * Refactor speed Jan 10, 2015
lib
.gitignore * Upd gitignore Jan 9, 2015
README.md Update README.md Oct 30, 2016

README.md

Runtime performance comparison of spaCy against other NLP libraries

Set up the corpus DB

The speed test expects to read documents from a simple SQLite table. More corpus injestors need to be written. So far there's one to create the table from the Gigaword corpus.

fab corpus.giga:path_to_gigaword/

Set up the tools

fab init

This should download and install spaCy and other NLP libraries.

Run a benchmark

fab speed:parse,spacy,n=1000
fab speed:tag,spacy
fab speed:tag,spacy,nltk,n=10000
fab speed:tokenize,spacy,clearnlp