WI kipedia
T erminolog Y
PI cker
WITYPI is a Python project with the aim to automatically design a terminology by using the Wikipedia's DB (kind of unsuppervised learning).
On Wiki, categories are linked together, and pages belong to these categories.
By creating a network graph between categories and applying TF-IDF on the vocabulary contained in all pages of every categories, we can extract important vocabulary for every class.
First, create a virtual environnement.
virtualenv -p /usr/bin/env python3 WITYPI
source /WITYPI/bin/activate
Then, by using pip3 after sourcing your virtualenv:
pip3 install -r requierement.txt
LipSuM
Then, simply launch
python3 __main__.py