New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Misalignment between LanguageModelFeaturizer and DIETClassifier in Chinese nlu data #10578
Comments
Exalate commented: xikaluo commented: In addition, if I remove the entity annotation in nlu.yml and 'entities' in domain.yml, this problem no longer appears |
Exalate commented: liugong commented: I have the same problem with Rasa 3.0.4, and is there any way to avoid this now? |
Exalate commented: xikaluo commented:
In fact I do not meet this problem again after updating rasa to 3.0.4. I guess maybe explicitly setting "entity_recognition" as "true" could solve the problem? |
Exalate commented: liugong commented:
By default "entity_recognition" is set to True, this is my configuration, could you please give me some guidance
|
after one week of understanding and debugging the whole code, I was able to find the root cause: for entity loss inside DIET the shape will mismatch: 4(one more for CLS) vs 3 @xikaluo |
when i remove the block of LanguageModelFeaturizer,the issue disappeared.I guess it is because module jieba and module LanguageModelFeaturizer are incompatible.Although in a certain version, they are compatible,hhhh |
➤ Maxime Verger commented: 💡 Heads up! We're moving issues to Jira: https://rasa-open-source.atlassian.net/browse/OSS. From now on, this Jira board is the place where you can browse (without an account) and create issues (you'll need a free Jira account for that). This GitHub issue has already been migrated to Jira and will be closed on January 9th, 2023. Do not forget to subscribe to the corresponding Jira issue! ➡️ More information in the forum: https://forum.rasa.com/t/migration-of-rasa-oss-issues-to-jira/56569. |
Rasa Open Source version
3.0.3
Rasa SDK version
3.0.2
Rasa X version
None
Python version
3.8
What operating system are you using?
OSX
What happened?
Hello Rasa team, I met the problem shown in the log during developing a Chinese conversation bot. Looks like there are some misalignment between LanguageModelFeaturizer and DIETClassifier when the language is Chinese.
The pipeline of my config file is
Command / Request
Relevant log output
The text was updated successfully, but these errors were encountered: