Skip to content
This repository

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP

Chinese Words Segment Library based on HMM model

tree: a10dd6358e

setup.py script

latest commit a10dd6358e
Sun Junyi authored
finalseg fix bug
.gitattributes init commit
.gitignore init commit
README.md doc
setup.py setup.py script
test.py document
README.md

finalseg

Chinese Words Segment Library in Python based on HMM Model

Usage

  • 自动:easy_install finalseg
  • 手动: 将finalseg目录放置于当前目录或者site-packages目录

代码示例

import finalseg

sentence_list = [
"姚晨和老凌离婚了",
"他说的确实在理",
"长春市长春节讲话"
]

print u"=默认效果"

for sentence in sentence_list:
    seg_list = finalseg.cut(sentence)
    print "/ ".join(seg_list)

print u"\n=打开新词发现功能后的效果\n"


for sentence in sentence_list:
    seg_list = finalseg.cut(sentence,find_new_word=True)
    print "/ ".join(seg_list)

Algorithm

Performance

  • 200 KB/Second
  • Test Env: Intel(R) Core(TM) i7-2600 CPU @ 3.4GHz;《围城》.txt

Example

Something went wrong with that request. Please try again.