Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", presented at LREC 2016.
Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
Python 2 & 3 wrapper around the Stanford Topic Modeling Toolbox. Intended to be used for hassle-free supervised topic classification with Labeled Latent Dirichlet Allocation (L-LDA, LLDA, sLDA).
Python code for reading Brat Repositories. Supports saving and reading from XML files for easy acces to annotations.