Skip to content

v1.0

Latest

Choose a tag to compare

@OrianeN OrianeN released this 06 Feb 15:53
· 2 commits to main since this release

The official v1.0 version of the OcWikiDialects dataset (JSONL and CSV train/dev/test splits) and trained ocDI models.

As the FastText embeddings model is too large to be added to the assets of the release, we will publish it separately.