Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Label extraction only for english. #3

Open
schmarion opened this issue May 3, 2023 · 1 comment
Open

Label extraction only for english. #3

schmarion opened this issue May 3, 2023 · 1 comment

Comments

@schmarion
Copy link

Hi,
I am using chat-intents and the clustering works very well.
However, I am working with french data and the label extraction gives poor results. I assume it's because this method necessarily uses a specialized spacy model for English.
I was wondering if the name of the loaded spacy model or at least the language could be passed as a parameter of apply_and_summarize_labels for example ?
This way, the performance could be much better for all languages other than English.

@dborrelli
Copy link
Owner

Hi,
Interesting suggestion! Adding the language model as a parameter should be straightforward, but I'd want to ensure the approach still works well with other languages and I'm a bit bandwidth-limited at the moment. Happy to consider a PR though if you have something working.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants