Grow your team on GitHub
GitHub is home to over 28 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.Sign up
FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are support, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for processi…
grlc builds Web APIs using shared SPARQL queries
A set of workflows for corpus building through OCR, post-correction and Natural Language Processing
B&G LABS experimental space; React based UI components; testing LABS APIs; etc
Data for Frog, mandatory
LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilation/installation script
Proposal for crosswalks between a number of video annotation tools, including the CLARIAH Web Annotation tool, ELAN, FrameTrail, VIAN and Waldorf.js.
Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.
Command-line utilities for working with the Format for Linguistic Annotation (FoLiA), powered by libfolia (C++), written by Ko van der Sloot
Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules …
FoLiA library for C++
Guidelines for software quality & sustainability (CLARIAH WP2 task 54.100)
WP4 SPARQL queries using hisco data
SERPENS: SEaRch PEst and Nuisance Species (A CLARIAH Research Pilot Project)
Toad: Trainer Of All Data, the Frog training collection
This repository contains the converter files used to convert .csv's into RDF using COW.
Amsterdam Time Machine
Service for converting CSV to the CSVW RDF format using COW
This repository holds some schemas used by tool and service metadata specifications
Safely convert IRI-like string to IRI.
Current historical studies of career mobility often focus on linkage of personal records such as baptism records. More qualitative sources, such as biographies contain vital information as well, but are labour intensive to process. We propose a combination of Robust Semantic Parsing and Linked Data conversion tools to automatically derive career…
Queries related to github.com/CLARIAH/BdVteaching
teaching materials for a replication study using Linked Data
Quickly turn command-line applications into RESTful webservices with a web-application front-end. You provide a specification of your command line application, its input, output and parameters, and CLAM wraps around your application to form a fully fledged RESTful webservice.
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Mor…
FoLiA Linguistic Annotation Tool - Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.github.io/folia), a rich XML-based format for linguistic annotation. Flat allows users to view annotated FoLiA documents and enrich these documents with new annotations, a wide variety of linguistic annotation ty…