SVMs based Vietnamese morphological analyzer. Web demo is old version.
Python
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
models
tests
visvmtagger
.gitattributes
.gitignore
.travis.yml
LICENSE.txt
MANIFEST.in
README.md
setup.py
viet_morph_analyze.py

README.md

Vietnamese morphological analyzer using SVMs.

SVMs based morphological analyzer for word segmentation and part-of-speech tagging.

Old version(Python2 and YamCha) is here.

Usage

$ pip install visvmtagger
$ python
>>> from visvmtagger import Tagger
>>> t = Tagger()
>>> t.tokenize("Tôi là sinh viên .")
[Tôi(B-PP), là(B-VB), sinh(B-NN), viên(I-NN), .(B-SB)]
>>> t.tokenize("Tôi là sinh viên .")[0].surface # pos is also available
'Tôi'

How to make model file

Please see a main() in visvmtagger/train.py .

License

MIT