💫 Runtime performance comparison of spaCy against other NLP libraries
Python
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
bin
fabfile
lib
.gitignore * Upd gitignore Jan 9, 2015
README.md

README.md

Runtime performance comparison of spaCy against other NLP libraries

Set up the corpus DB

The speed test expects to read documents from a simple SQLite table. More corpus injestors need to be written. So far there's one to create the table from the Gigaword corpus.

fab corpus.giga:path_to_gigaword/

Set up the tools

fab init

This should download and install spaCy and other NLP libraries.

Run a benchmark

fab speed:parse,spacy,n=1000
fab speed:tag,spacy
fab speed:tag,spacy,nltk,n=10000
fab speed:tokenize,spacy,clearnlp