The official v1.0 version of the OcWikiDialects dataset (JSONL and CSV train/dev/test splits) and trained ocDI models.
As the FastText embeddings model is too large to be added to the assets of the release, we will publish it separately.
The official v1.0 version of the OcWikiDialects dataset (JSONL and CSV train/dev/test splits) and trained ocDI models.
As the FastText embeddings model is too large to be added to the assets of the release, we will publish it separately.