Skip to content
@common-parallel-corpora

common-parallel-corpora

Common Tools and Resources for Machine Translation in More Languages

Corpora

Common Parallel Corpora: A high-quality community-driven extension of multitext-nllb-seed, flores-200, and ntrex-128 to more languages: nqo_Nkoo, ful_Adlm (coming soon).

Projects

Fria||el

Fria||el is a collaborative parallel text curation software system that tracks individual segments through a translation and copyedit workflow. Each segment is translated by one translator, and subsequently sequentially copyedited by other translators. Fria||el allows translators to simultaneously inspect variants of the source segment in multiple languages. This results in segments translated and copyedited in the context of different subsets of source languages. In addition to the final parallel corpus, Fria||el also yields copyedit logs, which could be valuable in various modeling scenarios.

Machine Translation for Manding Languages Written in Nko.

"Machine Translation for Nko: Tools, Corpora and Baseline Results." paper code

Machine Translation for Fulfulde Written in Adlam.

(coming soon)

Pinned Loading

  1. common-parallel-corpora common-parallel-corpora Public

    Makefile 6 1

Repositories

Showing 4 of 4 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…