A collection of encoded archival description XML documents for text and content analysis.
-
Updated
Jun 6, 2024 - Shell
A collection of encoded archival description XML documents for text and content analysis.
Kyrgyz language processing software, models and datasets.
A repo that demonstrates how to build Blacklab corpus via Docker and Nginx.
Mozilla Firefox places.sqlite tables exported to XML files. A Bash script.
➰Loop through a TSV file and pass columns of data to an external program. A Bash script.
computation with LaTeX math corpus
Collection of open source javascript projects
(WIP) Create tens of binaries from GitHub projects with the same compiler flags
mirror of VoxCeleb dataset - a large-scale speaker identification dataset
Useful shell commands for NLP people
Command-line corpus tools
Arabic Keyphrase Extraction Corpus
Add a description, image, and links to the corpus topic page so that developers can more easily learn about it.
To associate your repository with the corpus topic, visit your repo's landing page and select "manage topics."