Skip to content

ruhillo/UzNLP

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

UzNLP.uz

UzNLP.uz is an open-source Natural Language Processing (NLP) platform for the Uzbek language.
The project provides datasets, linguistic resources, pretrained models, and research tools designed to advance computational linguistics and AI research for low-resource Turkic languages.

🎯 Project Goals

  • Develop high-quality Uzbek NLP datasets
  • Build and release pretrained transformer-based models
  • Provide linguistic annotation tools (NER, POS, Lemmatization, SRL)
  • Support academic and industrial NLP research
  • Create infrastructure for corpus management and evaluation

🚀 Features

  • Named Entity Recognition (NER)
  • Part-of-Speech Tagging (POS)
  • Lemmatization & Morphological Analysis
  • Sentence Segmentation
  • Sentiment Analysis
  • Text Classification
  • Coreference Resolution
  • Uzbek NLP Datasets
  • API and Web-based tools

🧠 Models

  • Transformer-based models (mBERT, XLM-R)
  • Custom fine-tuned Uzbek models
  • Statistical and neural approaches
  • Hybrid rule-based + deep learning systems

📊 Datasets

  • Annotated NER corpora
  • POS-tagged datasets
  • Coreference datasets
  • Slang detection datasets
  • Academic and domain-specific corpora

🏛 Institution

Developed within academic research initiatives in Uzbekistan to promote AI and computational linguistics research.

🤝 Contributing

We welcome researchers, developers, and linguists to contribute to Uzbek NLP ecosystem.

📄 License

Specify your license here (MIT / Apache 2.0 / GPL / etc.)

About

UzNLP.uz — Open Uzbek Natural Language Processing Platform providing datasets, models, and tools for Uzbek language text processing including NER, POS tagging, lemmatization, sentiment analysis, and corpus management.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors