Skip to content

Sample multilingual data and tools for creating the data - used for NLP multilingual NLP research

License

Notifications You must be signed in to change notification settings

huu4ontocord/sungai

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 

Repository files navigation

sungai

Sungai (pronounced soon-nai) means river in Malay and is a sample multilingual dataset. It is meant to be used for NLP multilingual model distillation. "mdd" stands for "multilingual distillation dataset".

About

Sample multilingual data and tools for creating the data - used for NLP multilingual NLP research

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published