Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

分词错误(bug) #82

Open
GoogleCodeExporter opened this issue Mar 14, 2016 · 7 comments
Open

分词错误(bug) #82

GoogleCodeExporter opened this issue Mar 14, 2016 · 7 comments

Comments

@GoogleCodeExporter
Copy link

用例设计:
输入:
#1 增加了自己的词库,加入下面4个词语:
北太平庄
北太平庄店
独一味
李军刚

#2 输入分词:
东西塔四惠东水立方李军刚

输出:
东西
东西塔
四惠塔
惠东塔
立方塔
水立方
李军刚

错误地方:
四惠塔(这个塔不应该在这里)


Original issue reported on code.google.com by fantaxy0...@gmail.com on 19 Jan 2011 at 12:31

@GoogleCodeExporter
Copy link
Author

这个错误很严重阿:
输入:
仅36元!尽享『金汉森泉城路店』原价48元的南美烤肉自助餐�
��
输出:
wordList:[仅, 36, 元6, 尽享, 金汉, 汉森, 泉城, 路店, 原价, 48, 
元8, 南美, 烤肉, 自助餐]

Original comment by fantaxy0...@gmail.com on 20 Jan 2011 at 6:05

@GoogleCodeExporter
Copy link
Author

第一次发现这么严重的问题。。。。
请介绍你使用的版本(下载下来的,或svn checkout下来的?)等等

Original comment by qieqie.wang on 20 Jan 2011 at 6:09

@GoogleCodeExporter
Copy link
Author

我用的是从svn下载下来的。下载日期是:昨天,也就是2011-0
1-19中午。

下载的命令没有改变过:
svn checkout http://paoding.googlecode.com/svn/trunk/ paoding-read-only

下载下来发现,有两个包:
paoding-analysis 和 paoding-analysis-1
我用的是paoding-analysis,因为paoding-analysis-1中文-->>有乱码.

完毕。

Original comment by fantaxy0...@gmail.com on 20 Jan 2011 at 6:18

@GoogleCodeExporter
Copy link
Author

先不从svn上check,从download页下来的看看

Original comment by qieqie.wang on 20 Jan 2011 at 6:21

@GoogleCodeExporter
Copy link
Author

Download的没有最新的版本;
原因:
现在想跟solr1.4 和lucen3.0.3 
结合,但是paoding的旧版本有问题。。。

怎么办呢?

Original comment by fantaxy0...@gmail.com on 20 Jan 2011 at 6:27

@GoogleCodeExporter
Copy link
Author

我和楼上碰到的问题一样 郁闷了

Original comment by tree135...@gmail.com on 20 Feb 2011 at 10:33

@GoogleCodeExporter
Copy link
Author

我本周内搞定

Original comment by qieqie.wang on 21 Feb 2011 at 1:34

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant