Skip to content
Natural Language Processing library for ( πŸ‡΅πŸ‡°)Urdu language.
Branch: master
Clone or download
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
.github Bug fixes (#24) May 25, 2019
docs Contribs (#19) Feb 25, 2019
urduhack
.gitignore [git] .idea/* folder added in ignore file. Dec 27, 2018
.pylintrc [normalization] punct added for character normalization. Feb 7, 2019
.travis.yml [ci] travis python 3.7 support added. Apr 20, 2019
LICENSE Initial commit Dec 27, 2018
README.md [readme] backers support added. Mar 3, 2019
dev-requirements.txt
requirements.txt Character mapping (#1) Jan 2, 2019
setup.cfg Docs beta Jan 11, 2019
setup.py [setup.py] project metadata option added. Apr 2, 2019

README.md

Urduhack: NLP library for ( πŸ‡΅πŸ‡° ) Urdu language

License: MIT image image wheel Build Status codecov Last commit image Downloads Join Slack Say Thanks!

Feature Support

  • Normalization
    • Arabic and Urdu Unicode Redundancy Problem
    • Character Normalization
    • Combined Characters Normalization
    • Diacritics Removal
    • Spaces Before & After Digits
    • Spaces After Punctuations
    • Joined Words Fix
  • Tokenization
    • Sentence Tokenization
    • Words Tokenization

Roadmap

  • Classification
    • Sentimental Analysis
    • Sentence Classification
    • Documents Classification
  • Name Entity Recognition
  • Image to Text
  • Speak to Text

Installation

Urduhack officially supports Python 3.6–3.7, and runs great on PyPy.

To install Requests, simply use pip

$ pip install urduhack

Documentation

Fantastic documentation is available at https://urduhack.readthedocs.io/

How to Contribute

  1. Check for open issues or open a fresh issue to start a discussion around a feature idea or a bug. There is a Contributor Friendly tag for issues that should be ideal for people who are not very familiar with the codebase yet.
  2. Write a test which shows that the bug was fixed or that the feature works as expected.
  3. Send a pull request and bug the maintainer until it gets merged and published. :)

Contributors

Special thanks to everyone who contributed to getting the UrduHack to the current state.

Backers Backers on Open Collective

Thank you to all our backers! πŸ™ [Become a backer]

Sponsors Sponsors on Open Collective

Support this project by becoming a sponsor. Your logo will show up here with a link to your website. [Become a sponsor]

Copyright and license

Code released under the MIT License.

You can’t perform that action at this time.