Sungai (pronounced soon-nai) means river in Malay and is a sample multilingual dataset. It is meant to be used for NLP multilingual model distillation. "mdd" stands for "multilingual distillation dataset".
-
Notifications
You must be signed in to change notification settings - Fork 0
Sample multilingual data and tools for creating the data - used for NLP multilingual NLP research
License
huu4ontocord/sungai
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
About
Sample multilingual data and tools for creating the data - used for NLP multilingual NLP research
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published