溢出的bug #12

GoogleCodeExporter · 2016-03-14T02:07:23Z

paoding> 二十亿;
1:      2000000000/

        分词器net.paoding.analysis.analyzer.PaodingAnalyzer
        内容长度 3字符， 分 1个词
        分词耗时 16ms
--------------------------------------------------
paoding> 二十一亿;
1:      2100000000/

        分词器net.paoding.analysis.analyzer.PaodingAnalyzer
        内容长度 4字符， 分 1个词
        分词耗时 16ms
--------------------------------------------------
paoding> 二十二亿;
        All are noise characters or words
--------------------------------------------------

Original issue reported on code.google.com by fore...@gmail.com on 24 Apr 2008 at 8:36

The text was updated successfully, but these errors were encountered:

GoogleCodeExporter · 2016-03-14T02:07:23Z

报告者，你好

首先，是在什么应用环境下溢出的，以及溢出的异常是什么��
�
其次，你的物理内存，虚拟机内存也需要报告；

Original comment by qieqie.wang on 24 Apr 2008 at 8:40

GoogleCodeExporter · 2016-03-14T02:07:23Z

你好。抱歉没说清楚。这个bug和物理内存，虚拟机内存没有��
�系。
我观察分词把汉字的数字全部转换为阿拉伯数字。就测试一��
�边界值。
int类型的最大值是2147483647，大概是21亿。
源代码CharSet.toNumber返回的是int值，在CJKKnife.collectNumber里，数
字全部是int型。
这样切大于21亿的数字时，就得不到预期的结果吧。

Original comment by fore...@gmail.com on 24 Apr 2008 at 8:50

GoogleCodeExporter · 2016-03-14T02:07:23Z

OK,谢谢

Original comment by qieqie.wang on 24 Apr 2008 at 8:54

Changed state: Accepted

GoogleCodeExporter added Priority-Medium Type-Defect auto-migrated labels Mar 14, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

溢出的bug #12

溢出的bug #12

GoogleCodeExporter commented Mar 14, 2016

GoogleCodeExporter commented Mar 14, 2016

GoogleCodeExporter commented Mar 14, 2016

GoogleCodeExporter commented Mar 14, 2016

溢出的bug #12

溢出的bug #12

Comments

GoogleCodeExporter commented Mar 14, 2016

GoogleCodeExporter commented Mar 14, 2016

GoogleCodeExporter commented Mar 14, 2016

GoogleCodeExporter commented Mar 14, 2016