Skip to content

A fast and accurate POS and morphological tagging toolkit (EACL 2014)

License

Notifications You must be signed in to change notification settings

Esukhia/bordr

 
 

Repository files navigation

bordr

A pip installable version of RDRPOSTagger with Tibetan-specific changes.

Maintenance

Build the source dist:

rm -rf dist/
python3 setup.py clean sdist

and upload on twine (version >= 1.11.0) with:

twine upload dist/*

Latest change

The SDICT content passed to generate INIT file is changed. The words in SDICT are given U(Unique tag from bilou tagging system) tag as those words are segmented as Unique token by botok. With that changed SDICT content, we will get INIT file based on botok segmentation. Hence rules generated will be able to resolve botok segmentation ambiguity.

About

A fast and accurate POS and morphological tagging toolkit (EACL 2014)

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 76.3%
  • HTML 23.7%