Skip to content
@kuhumcst

Center for Sprogteknologi, Copenhagen University

Loading…

affixtrain

Using supervised learning, create a set of affix rules for use by the CSTlemma lemmatiser.

Updated

cstlemma

Lemmatiser that uses affix rules (affix: prefix, infix, suffix, circumfix). Rules are obtained by supervised learning from a full form - lemma list.

Updated

Java 3 1

Anvil-Facetracker

OpenCV-based Plugin for the Anvil annotation software that tracks faces and creates annotations when velocity or acceleration thresholds are transgressed.

Updated

rtfreader

Reads an RTF or flat text file and outputs the text, one line per sentence & optionally tokenized.

Updated

taggerXML

Modernized version of Eric Brill's Part Of Speech tagger.

Updated

makeUTF8

converts UTF-16 (BE/LE), UTF-32 (BE/LE), ISO-8859-N to UTF-8. Removes BOM and surrogate pairs from UTF-8, converting a codepoint between U-D800 and U-DBFF followed by a codepoint between U-DC00 and U-DFFF to one valid codepoint > U-FFFF.

Updated

letterfunc

Functions for upper/lower casing, for testing whether a character is a letter and for conversion between Unicode encodings UTF-8 and UTF-16

Updated

hashmap

Simple implementation of a hash map using separate chaining. The table allocates more buckets if the load factor is more than 100% and frees buckets if the loadfactor falls below 20%.

Updated

Java 0 0

DK-ClarinTools

Servlet that computes candidate tool workflows given input file(s) and the user's requirements regarding the output. Afterwards, runs a workflow selected by the user from the list of candidates.

Updated

parsesgml

Parse sgml, html and xml in a forgiving way.

Updated

Something went wrong with that request. Please try again.