Skip to content
corpus data for an isiXhosa - English parallel corpus
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information.


Corpus data for an isiXhosa - English parallel corpus. The text and (partial) alignment has been collected, aligned, and manually translated as part of the Medical Machine Translation (MeMaT) project.

The MeMaT project was a collaboration with the University of Cape Town, South Africa, and the University of Edinburgh, UK, and funded by the EPSRC, UK (WT5518939), for 7 months in 2017-2018.


UoE: Kenneth Heafield, Alexandra Birch, Proyag Pal.

UCT: Maria Keet, Zola Mahlaza.

The professional translator was Ntombi Kwebulana

Use and reuse

The data is available under a CC-BY licence, unless specified differently (e.g., the mobile-xhosa data has the stricter CC NC-SA licence).

You can’t perform that action at this time.