A collection of encoded archival description XML documents for text and content analysis.
-
Updated
Jun 6, 2024 - Shell
A collection of encoded archival description XML documents for text and content analysis.
➰Loop through a TSV file and pass columns of data to an external program. A Bash script.
A repo that demonstrates how to build Blacklab corpus via Docker and Nginx.
Mozilla Firefox places.sqlite tables exported to XML files. A Bash script.
computation with LaTeX math corpus
Collection of open source javascript projects
(WIP) Create tens of binaries from GitHub projects with the same compiler flags
Useful shell commands for NLP people
Arabic Keyphrase Extraction Corpus
Command-line corpus tools
Kyrgyz language processing software, models and datasets.
mirror of VoxCeleb dataset - a large-scale speaker identification dataset
Add a description, image, and links to the corpus topic page so that developers can more easily learn about it.
To associate your repository with the corpus topic, visit your repo's landing page and select "manage topics."