v1.0

Latest

Latest

OrianeN released this 06 Feb 15:53

· 2 commits to main since this release

a61727a

The official v1.0 version of the OcWikiDialects dataset (JSONL and CSV train/dev/test splits) and trained ocDI models.

As the FastText embeddings model is too large to be added to the assets of the release, we will publish it separately.

Assets 7