Grow your team on GitHub
GitHub is home to over 28 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.Sign up
FoLiA library for C++
Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules …
Ticcutils, a generic utility library shared by our software.
Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.
Test suite for libfolia
Unit tests for Frog
Tools for TICCL
Lama Events is a calendar application listing events in the near future. The events are detected and selected by a fully automatic procedure in the Dutch Twitter stream.
A set of workflows for corpus building through OCR, post-correction, modernization of historic language and Natural Language Processing
Data for Frog, mandatory
MBT: Memory-based tagger generation and tagging MBT is a memory-based tagger-generator and tagger in one.
Command-line utilities for working with the Format for Linguistic Annotation (FoLiA), powered by libfolia (C++), written by Ko van der Sloot (CLST, Radboud University)
Brew formulas for installing NLP software developed by the Language Machines research group
BP-SOM: A hybrid of back-propagation learning in multi-layered perceptrons and self-organizing maps
Toad: Trainer Of All Data, the Frog training collection
Unit tests for Mbt
Unit tests for Timbl
TiMBL implements several memory-based learning algorithms.
Datafiles for the tokenizer ucto.
Memory Based Word Predictor/Language Model http://ilk.uvt.nl/wopr/
A workflow system for Natural Language Processing.
TiMBL implements several memory-based learning algorithms. This is the server part.
CLST webservices software metadata, only for those webservices/webapplications that are not included in LaMachine
Distributed Tilburg Memory Based Learner
Family Memory Based Learning (original in ILK SVN)
small program to test travis issues. Like OSX and Clang OpenMP support