Boosting exact matches #110

rabberbock · 2017-01-19T04:06:04Z

Thanks so much for such an awesome project! I am really enjoying testing it out.

One thing I noticed is that an exact match does not have a confidence of 1.0. Sometimes it may even have a low confidence. What would your approach be to boost exact matches. Obviously, we can just search for an exact match, but I was wondering if in the model itself there is a way to improve the score.

Any help would be much appreciated. Thanks again for the awesome work!

amn41 · 2017-01-22T10:35:45Z

I'm glad you're excited about this project. If I understand correctly it sounds like you want to overfit to your training data :) which would probably hurt the model's ability to generalise to new examples.
I'm open to making the scores more informative/ useful though. So please provide some concrete examples and I'll be happy to look at them.

rabberbock · 2017-01-25T15:20:05Z

@amn41
You got it right, I guess I am trying to overfit my training data.
I guess I do not really understand what the score means. Is this how likely it is that it mapped to the correct intent? I guess this score would grow with more training data, does that make sense?

Also the type of examples of was trying to train the model on are similar to following
What is task one?
What is task two?
Tell me about task one
Tell me about task two
Each of these intents have examples such as How do i clean my dishwasher? and other variants. I know this may not be the best way of doing it it, maybe an entity approach would be better for that type of questions. Is that what you would suggest? Would the model be inaccurate if I had many examples of what is task X or tell me about task one for each intent?

Thanks,

Raffi

amn41 · 2017-01-25T15:45:31Z

If there's really only one word difference I would recommend using one intent, call it ask_about_task or something, and have the task type as an entity :)

Regarding the confidence, it's the estimated probability that the text matches the predicted intent. So if you have two intents and 51% confidence, that's pretty weak. If you have 50 intents and 25% confidence, that's actually quite high, if that makes sense!

rabberbock · 2017-01-31T01:14:35Z

Thanks! That makes sense. However, perhaps I am training wrong. I tried the same dataset with both mitie and spacy_sklearn and the confidence for mitie is much higher (An intent with spacy_sklearn would be 0.3, while with mitie it is 0.9 and above). Is there a reason for this? My mitie training took a little over an hour on a 16gb macbook pro. With spacy_sklearn, the same training takes about 20 seconds. There were 19 intents with 85 utterances.

Any insights into this would be much appreciated! Thanks!

amn41 · 2017-01-31T07:38:00Z

The confidence you get from MITIE is a score not a true probability, I wouldn't read too much into the exact numbers. Training can take a while with MITIE because it tries too hard to optimise some parameters. You can use my fork which trains quicker

rabberbock · 2017-01-31T15:12:46Z

Thanks so much for taking the time to respond! I think I am actually using your fork already because you had mentioned it in another issue. Even with your fork does it make sense that training with mitie could take over 6 hours for 19 intents with 115 utterances? spacy_sklearn just takes 30 seconds on the same dataset.

In regards to the score, a lowish score with mitie, still means that the probability that it mapped to the correct intent is low, correct? To me the mitie score just seems to make more sense me than the probability in spacy_sklearn. When I see a score above 0.9 in mitie to me that sounds pretty good. Does mitie do better on a smaller dataset than spacy_sklearn? With spacy_sklearn, there seemed to be little difference in the probablity of a correct and incorrect mapping of an intent. I also thought mitie did a better job with utterances with words that overlapped.

With spacy_sklearn during training I get F-score is ill-defined and being set to 0.0 due to no predicted samples once or twice, but I do not what utterances in is having trouble with. Is there a reason why I would get this error?

Thanks again! Any help would be much appreciated!

amn41 · 2017-01-31T15:36:04Z

Re: training times - I think I'll bring the mitie_sklearn backend up to date to resolve this.

The warning about the f-score is probably the following: when training rasa randomly splits the data into a training (80%) and a test set (20%). If you have very few examples of one intent, it can be that the test set doesn't have doesn't contain any examples of that intent, which generates this warning.

rabberbock · 2017-01-31T15:56:37Z

Cool, that sounds great! Is mitie_sklearn something that I could even use now or is that something that is a work in progress? I though there were only two out of the box choices mitie or spacy_sklearn

amn41 · 2017-01-31T16:00:31Z

mitie_sklearn is kind of a hybrid, using MITIE's word vectors and named entity recogniser, but sklearn to train the intent classifier. It works, I've used it before, but might not be fully up to date. Try it out and let me know if you get any errors.

rabberbock · 2017-01-31T21:01:39Z

Thanks for the information!

I see the training and interpreter classes for mitie_sklearn, but I think that the trainer is missing sklearn_text_categorizer_trainer (line 47) method/class, which I cannot seem to locate.

Thanks again for all your help!

amn41 · 2017-02-01T16:04:42Z

Ok yeah it's out of date. I'll find some time to fix this :)

prepared release of version 3.7.0rc4

amn41 closed this as completed Jan 23, 2017

taytzehao pushed a commit to taytzehao/rasa that referenced this issue Jul 14, 2023

fix RasaHQ#110 - support path in action (RasaHQ#111)

6e1da1a

vcidst pushed a commit that referenced this issue Jan 23, 2024

Merge pull request #110 from RasaHQ/prepare-release-3.7.0rc4

9342270

prepared release of version 3.7.0rc4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Boosting exact matches #110

Boosting exact matches #110

rabberbock commented Jan 19, 2017

amn41 commented Jan 22, 2017

rabberbock commented Jan 25, 2017 •

edited

amn41 commented Jan 25, 2017

rabberbock commented Jan 31, 2017

amn41 commented Jan 31, 2017

rabberbock commented Jan 31, 2017

amn41 commented Jan 31, 2017

rabberbock commented Jan 31, 2017

amn41 commented Jan 31, 2017

rabberbock commented Jan 31, 2017 •

edited

amn41 commented Feb 1, 2017

Boosting exact matches #110

Boosting exact matches #110

Comments

rabberbock commented Jan 19, 2017

amn41 commented Jan 22, 2017

rabberbock commented Jan 25, 2017 • edited

amn41 commented Jan 25, 2017

rabberbock commented Jan 31, 2017

amn41 commented Jan 31, 2017

rabberbock commented Jan 31, 2017

amn41 commented Jan 31, 2017

rabberbock commented Jan 31, 2017

amn41 commented Jan 31, 2017

rabberbock commented Jan 31, 2017 • edited

amn41 commented Feb 1, 2017

rabberbock commented Jan 25, 2017 •

edited

rabberbock commented Jan 31, 2017 •

edited