@LanguageMachines

Language Machines

NLP Research group at Centre for Language Studies, Radboud University Nijmegen

  • FoLiA library for C++

    C++ 10 4 GPL-3.0 Updated Oct 18, 2018
  • Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules …

    C++ 35 5 GPL-3.0 Updated Oct 18, 2018
  • Ticcutils, a generic utility library shared by our software.

    C++ 6 3 GPL-3.0 Updated Oct 18, 2018
  • Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.

    C++ 41 8 GPL-3.0 2 issues need help Updated Oct 18, 2018
  • Test suite for libfolia

    C++ GPL-3.0 Updated Oct 18, 2018
  • Unit tests for Frog

    Lex Updated Oct 18, 2018
  • Tools for TICCL

    C++ 10 1 GPL-3.0 Updated Oct 16, 2018
  • Lama Events is a calendar application listing events in the near future. The events are detected and selected by a fully automatic procedure in the Dutch Twitter stream.

    HTML 10 3 Updated Oct 15, 2018
  • A set of workflows for corpus building through OCR, post-correction, modernization of historic language and Natural Language Processing

    Groovy 22 2 GPL-3.0 Updated Oct 15, 2018
  • Python 3 GPL-3.0 Updated Oct 4, 2018
  • Data for Frog, mandatory

    Lex 1 GPL-3.0 Updated Oct 3, 2018
  • MBT: Memory-based tagger generation and tagging MBT is a memory-based tagger-generator and tagger in one.

    C++ 5 1 GPL-3.0 Updated Oct 2, 2018
  • Command-line utilities for working with the Format for Linguistic Annotation (FoLiA), powered by libfolia (C++), written by Ko van der Sloot (CLST, Radboud University)

    C++ 1 2 GPL-3.0 Updated Oct 2, 2018
  • Brew formulas for installing NLP software developed by the Language Machines research group

    Ruby 1 1 Updated Sep 4, 2018
  • BP-SOM: A hybrid of back-propagation learning in multi-layered perceptrons and self-organizing maps

    C++ 1 GPL-3.0 Updated Aug 13, 2018
  • Toad: Trainer Of All Data, the Frog training collection

    C++ 1 GPL-3.0 Updated Jul 13, 2018
  • Python 1 1 Updated Jul 3, 2018
  • Unit tests for Mbt

    Lex Updated Jun 28, 2018
  • Unit tests for Timbl

    Elixir Updated Jun 4, 2018
  • TiMBL implements several memory-based learning algorithms.

    C++ 26 6 GPL-3.0 Updated Jun 4, 2018
  • Datafiles for the tokenizer ucto.

    Shell 6 2 GPL-3.0 Updated May 28, 2018
  • Memory Based Word Predictor/Language Model http://ilk.uvt.nl/wopr/

    C++ 2 Updated May 19, 2018
  • A workflow system for Natural Language Processing.

    Python 14 2 GPL-3.0 Updated May 17, 2018
  • C++ 1 GPL-3.0 Updated May 16, 2018
  • TiMBL implements several memory-based learning algorithms. This is the server part.

    C++ 3 GPL-3.0 Updated May 16, 2018
  • CLST webservices software metadata, only for those webservices/webapplications that are not included in LaMachine

    Updated May 16, 2018
  • Distributed Tilburg Memory Based Learner

    C++ 1 2 GPL-3.0 Updated Mar 26, 2018
  • CSS Updated Feb 28, 2018
  • Family Memory Based Learning (original in ILK SVN)

    C GPL-3.0 Updated Feb 22, 2018
  • small program to test travis issues. Like OSX and Clang OpenMP support

    M4 Updated Feb 6, 2018