-
-
Notifications
You must be signed in to change notification settings - Fork 278
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
词性和权重的分割 #58
Labels
Comments
谢谢提醒! |
词性标注 具体 有哪些词性? |
thx @yanyiwu 不过jieba部分没有中文注解 如
|
最近有个需求要做智能控制,所以分词上有很多问题会需要请教,能给个联系方式吗,比如微信什么的 { cut: [ '把', '卧室', '所有', '的', '灯', '都', '关', '了' ],
tag:
[ { word: '把', tag: 'p' },
{ word: '卧室', tag: 'n' },
{ word: '所有', tag: 'b' },
{ word: '的', tag: 'uj' },
{ word: '灯', tag: 'n' },
{ word: '都', tag: 'd' },
{ word: '关', tag: 'v' },
{ word: '了', tag: 'ul' } ],
extract: [ { word: '卧室', weight: 8.20023407859 } ] } { cut: [ '把', '卧室', '全部', '的', '灯', '都', '关', '了' ],
tag:
[ { word: '把', tag: 'p' },
{ word: '卧室', tag: 'n' },
{ word: '全部', tag: 'n' },
{ word: '的', tag: 'uj' },
{ word: '灯', tag: 'n' },
{ word: '都', tag: 'd' },
{ word: '关', tag: 'v' },
{ word: '了', tag: 'ul' } ],
extract: [ { word: '卧室', weight: 8.20023407859 } ] } { cut: [ '把', '卧室', '全部', '的', '灯关', '了' ],
tag:
[ { word: '把', tag: 'p' },
{ word: '卧室', tag: 'n' },
{ word: '全部', tag: 'n' },
{ word: '的', tag: 'uj' },
{ word: '灯关', tag: 'x' },
{ word: '了', tag: 'ul' } ],
extract:
[ { word: '灯关', weight: 11.739204307083542 },
{ word: '卧室', weight: 8.20023407859 } ] } 三个句子是差不多的。分别是
|
微信联系方式在README.md 最下面就有啊。 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
结果是数组,那么每一条拿出来之后还要 split,或者正则匹配出对应的数据,这样很麻烦。
为什么不直接放对象?
像这样。
还是说有什么更好的实践?
The text was updated successfully, but these errors were encountered: