Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

溢出的bug #12

Open
GoogleCodeExporter opened this issue Mar 14, 2016 · 3 comments
Open

溢出的bug #12

GoogleCodeExporter opened this issue Mar 14, 2016 · 3 comments

Comments

@GoogleCodeExporter
Copy link

paoding> 二十亿;
1:      2000000000/

        分词器net.paoding.analysis.analyzer.PaodingAnalyzer
        内容长度 3字符, 分 1个词
        分词耗时 16ms
--------------------------------------------------
paoding> 二十一亿;
1:      2100000000/

        分词器net.paoding.analysis.analyzer.PaodingAnalyzer
        内容长度 4字符, 分 1个词
        分词耗时 16ms
--------------------------------------------------
paoding> 二十二亿;
        All are noise characters or words
--------------------------------------------------

Original issue reported on code.google.com by fore...@gmail.com on 24 Apr 2008 at 8:36

@GoogleCodeExporter
Copy link
Author

报告者,你好

首先,是在什么应用环境下溢出的,以及溢出的异常是什么��
�
其次,你的物理内存,虚拟机内存也需要报告;

Original comment by qieqie.wang on 24 Apr 2008 at 8:40

@GoogleCodeExporter
Copy link
Author

你好。抱歉没说清楚。这个bug和物理内存,虚拟机内存没有��
�系。
我观察分词把汉字的数字全部转换为阿拉伯数字。就测试一��
�边界值。
int类型的最大值是2147483647,大概是21亿。
源代码CharSet.toNumber返回的是int值,在CJKKnife.collectNumber里,数
字全部是int型。
这样切大于21亿的数字时,就得不到预期的结果吧。

Original comment by fore...@gmail.com on 24 Apr 2008 at 8:50

@GoogleCodeExporter
Copy link
Author

OK,谢谢


Original comment by qieqie.wang on 24 Apr 2008 at 8:54

  • Changed state: Accepted

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant