-
-
Notifications
You must be signed in to change notification settings - Fork 422
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Wrong entity end. #17
Comments
相应这一段的你自己的标注json内容,能发出来看下吗? |
{ |
我有加入自定义词典 |
"江铃E200"包含了汉字与英文数字字符,我感觉是python2的编码问题。 |
好,我去试试 |
在给python3安装rasa配套包的时候发现了问题,python包的默认安装目录\usr磁盘满了,增加了硬盘空间后,重装python2的rasa,这个问题解决了,可能是因为这个原因导致安装不完全? |
x_X |
晕,订正一下,不是目录满的原因 |
Anyway,问题解决了就好:) |
2018-06-10 22:13:48 WARNING rasa_nlu.extractors.mitie_entity_extractor - Example skipped: Invalid entity {'start': 2, 'end': 4, 'value': '8点', 'entity': 'time'} in example '今天8点到9点45分有哪些闹钟': entities must span whole tokens. Wrong entity end. |
这个问题,你需要检查一下你的分词,你这样标注实体的话,你的分词必须是 |
@jxg972 搞定了,确实是分词的问题,把8点什么的加入用户字典就行了 |
您好,我定义的词典加载进去还是错误,没有进行训练rasa_rlu时候 自己亲自尝试使用使用该词典进行分词是正确的,您有好的建议吗 |
WARNING:rasa_nlu.extractors.mitie_entity_extractor:Example skipped: Invalid entity {u'start': 0, u'end': 6, u'value': u'\u6c5f\u94c3E200', u'entity': u'\u8f66\u7cfb'} in example '江铃E200VS东风风神AX7新能源': entities must span whole tokens. Wrong entity end.
这里报错说实体位置标注错了,但是分词结果却是一样的
for i in jieba.tokenize('江铃E200VS东风风神AX7新能源'):
print(i)
('江铃E200', 0, 6)
('VS', 6, 8)
('东风风神AX7新能源', 8, 18)
不知道是什么原因?
The text was updated successfully, but these errors were encountered: