A library that builds on nltk and gensim to automatically generate phrases.
Add the package to your python path using pip:
pip install phrase
To create a phrase dictionary and print out the top 25 phrases:
create_phrase_dictionary <corpus_folder> <phrase_dictionary_output_filename>
This is not a light process, it can take a lot of memory and time, so be warned.
To run all the tests, you need to run py.test to pick up the unit tests. Lettuce is currently being used for BDD tests and needs to be run from the tests folder or with tests/ added to the PYTHONPATH (the tests utilize the units.helpers modules)
py.test PYTHONPATH=tests lettuce tests/features cd tests lettuce features/