Skip to content

a preprocessed corpus of lojban sentences for machine translation exercises

Notifications You must be signed in to change notification settings

La-Lojban/phrases

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

phrases

a preprocessed corpus of lojban sentences for machine translation exercises

Contains sentences from jboTatoeba project. If a sentence has been translated by gleki,ilmen,uakci,jelca they are shown instead of the original translation at Tatoeba.

Lojban sentences are additionally preprocessed: diacritic orthography removed, cmavo clusters split, dots removed, sentences using {zoi} removed.

Sentences marked as B removed.

About

a preprocessed corpus of lojban sentences for machine translation exercises

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published