MaCoCu
MaCoCu focuses on collecting monolingual and parallel data from the Internet, specially for under-resourced languages and DSI-specific data.
Popular repositories Loading
-
-
-
-
BCMS-variant-classifier
BCMS-variant-classifier PublicA classification tool for discriminating between Bosnian, Croatian, Montenegrin, and Serbian
-
Manual-Checking-Web-Corpora-Guidelines
Manual-Checking-Web-Corpora-Guidelines PublicForked from TajaKuzman/GINCO-Genre-Annotation-Guidelines
The Guidelines for Manual Checking of Web Corpora
JavaScript
Repositories
Showing 10 of 10 repositories
- documentation Public
macocu/documentation’s past year of commit activity - BCMS-variant-classifier Public
A classification tool for discriminating between Bosnian, Croatian, Montenegrin, and Serbian
macocu/BCMS-variant-classifier’s past year of commit activity - HT-vs-MT Public Forked from tobiasvanderwerff/HT-vs-MT
Source code for EAMT 2022 paper "Automatic Discrimination of Human and Neural Machine Translation: A Study with Multiple Pre-Trained Models and Longer Context".
macocu/HT-vs-MT’s past year of commit activity - MaCoCu-crawler Public
macocu/MaCoCu-crawler’s past year of commit activity - Manual-Checking-Web-Corpora-Guidelines Public Forked from TajaKuzman/GINCO-Genre-Annotation-Guidelines
The Guidelines for Manual Checking of Web Corpora
macocu/Manual-Checking-Web-Corpora-Guidelines’s past year of commit activity