Repo enlisting Machine Learning datasets from Nepalese Researchers.
- Devanagiri Numbers(०-९) Spoken Audio
- Nepali ASR training data set: Nepali ASR training data set containing ~157K utterances
- Nepali Text to Speech: Dataset 1
- Nepali Text to Speech: Dataset 2
- Health Diseases in Nepali
- EPA Air Pollution Data
- Nepal Government Air Pollution Data
- Dristhi Air Pollution Data
- Voting Ballot Paper Dataset
- Cash Dataset: Image of Nepalese currency
- Images of 10, 50 & 100 rupee notes
- DHCD dataset: A dataset of Devnagari (Nepali) handwritten characters
- License Plate Recognition (LPR) dataset: Nepali Motorbike License plate dataset
- Nepali Characters Dataset
- Nepali Fonts OCR Dataset
- Nepali Handwritten Digits
- Vehicles Dataset: 4800 images of two-wheeler and four-wheeler vehicles from Nepal
- 16NepaliNews Corpus: 14,364 Nepali language news documents
- 65K Nepali Sentences
- Nepal Earthquake Tweets
- Nepali Chat Corpus
- Nagarik News Corpus
- Setopati News Corpus
- Laxmi Prasad Devkota Poems: Collection of poems of Laxmi Prasad Devkota and contains 119161 characters.
- Nepali Names
- Nepali News Classification Dataset
- Nepali Ngram
- Nepali Stopwords
- Nepali Wikipedia Articles Dataset
- Nepali Word List
- Nepali transliteration
- Nepali Textbooks: Collection of school textbooks from Nepal assembled by Professor of Anthropology Kathryn March over the last 30 years.