Fix OSS-413: Proper intents in interactive training #12722

ottonemo · 2023-08-09T16:19:54Z

In OSS-413 a user reports that since Rasa 3.1 the interactive training dialogue suggests malformed/shortened intent names, e.g.:

a
b
c

This is due to a bug in the parsing of intent names which assumes that every intent is a dictionary (which it is ONLY when a property such as use_entities is set).

While OSS-413 states that this is solely cosmetic it is an actual bug that caused severe problems that cost several hours to debug: When using the regex matcher it is checked whether the parsed intent is in the domain or not. When it is not, it will fail and attempt to revert the user utterance. The user utterance is then written to the tracker but SlotSet events are not repeated - therefore any form validator will fail for inexplicable reasons.

This also means that there is a bug in the _correct_wrong_nlu for not copying enough or re-starting the ActionExtractSlots action which is not addressed here.

This behavior is rather hard to test, if you have any suggestions, feel free to guide me.

Proposed changes:

Make a distinction between dict and str intent names from the domain

Status (please check what you already did):

added some tests for the functionality
updated the documentation
updated the changelog (please check changelog for instructions)
reformat files using black (please check Readme for instructions)

ancalita

@ottonemo Great, thank you for your contribution tackling this bug!
Before merging there are a few things we need to do:

please add a changelog entry here to describe both the bug and bugfix
please share some assistant example to reproduce the bug manually and confirm the bugfix you implemented fixes this
please open a separate ticket for the other issue you flagged: there is a bug in the _correct_wrong_nlu for not copying enough or re-starting the ActionExtractSlots action
since this is a bugfix it should target branch 3.6.x, please rebase, this way it can be released in a micro patch.

Could we add a unit test for record_messages that will include a lot of mocking and monkeypatching most of the functions used here apart from _validate_nlu which uses the intents list you modified, even here you will have to monkeypatch questionary.confirm to just either print (in which case it's advisable to use capsys fixture from pytest) or return the message to be asserted directly 🙏🏻
Let me know in case of any questions!

rasa/core/training/interactive.py

ottonemo · 2023-08-18T11:30:58Z

Hi, thanks for the comments!

please add a changelog entry here to describe both the bug and bugfix

Done.

please share some assistant example to reproduce the bug manually and confirm the bugfix you implemented fixes this

Sure! Luckily the moodbot example is sufficient. Simply run rasa interactive from, for example, the 3.6.x branch:

Since I maintain an internal middleware for Rasa that needed to deal with this issue I can confirm that the fix I submitted resolves the issue. However, just to be certain, here's a screenshot from the OSS-413 branch:

since this is a bugfix it should target branch 3.6.x, please rebase, this way it can be released in a micro patch.

I rebased the commits and the PR - I hope it is correct now.

Could we add a unit test for record_messages that will include a lot of mocking and monkeypatching most of the functions used here apart from _validate_nlu which uses the intents list you modified, even here you will have to monkeypatch questionary.confirm to just either print (in which case it's advisable to use capsys fixture from pytest) or return the message to be asserted directly 🙏🏻

That sounds very complicated and not very maintainable. What about refactoring the intent retrieval into a separate function and testing it for the two cases? This way we would only need to provide domains (i.e. from Domain.from_yaml().as_dict()).

NitinKumar94 · 2023-09-11T22:42:23Z

I would love to see this PR merged! I was wondering why rasa-interactive did not display the correct intent names and this does fix it! Bumping this thread so that it can be reviewed

ancalita

Thanks for addressing my review comments 🚀

I added one more request for the changelog, also please fix the formatting so that the quality check in the CI passes (it's a required step). You will need to run make format locally where you installed 3.6.x, then push the changes. I also recommend running make lint and make types, and fix any issues that might be flagged.

What about refactoring the intent retrieval into a separate function and testing it for the two cases?

Yes, great idea, please go ahead and add this unit test.

ancalita · 2023-09-13T10:48:40Z

changelog/12722.bugfix.md

+This will also fix a bug where forced user utterances (using the regex matcher) will
+be reverted even though they are present in the domain.


I would add the information you gave in the PR description, as that was clearer than these 2 lines 🙏🏻

In OSS-413 a user reports that since Rasa 3.1 the interactive training dialogue suggests malformed/shortened intent names, e.g.: - a - b - c This is due to a bug in the parsing of intent names which assumes that every intent is a dictionary (which it is ONLY when a property such as `use_entities` is set). While OSS-413 states that this is solely cosmetic it is an actual bug that caused severe problems that cost several hours to debug: When using the regex matcher it is checked whether the parsed intent is in the domain or not. When it is not, it will fail and attempt to revert the user utterance. The user utterance is then written to the tracker but `SlotSet` events are not repeated - therefore any form validator will fail for inexplicable reasons. This also means that there is a bug in the `_correct_wrong_nlu` for not copying enough or re-starting the `ActionExtractSlots` action which is not addressed here.

Since the main problem of bug OSS-413 is that intents with attributes are not retrieved well the test was implemented to use domains with and without intent definitions using attributes.

ottonemo · 2023-09-26T17:12:46Z

@ancalita I added the test cases and ran linting, formatting and type checkers. Please review again!

ancalita

Excellent, thanks @ottonemo ⭐

ottonemo requested a review from a team as a code owner August 9, 2023 16:19

ancalita reviewed Aug 15, 2023

View reviewed changes

rasa/core/training/interactive.py Outdated Show resolved Hide resolved

ottonemo force-pushed the issue/OS-413 branch from 2b468c3 to 8327a59 Compare August 18, 2023 11:07

ottonemo changed the base branch from main to 3.6.x August 18, 2023 11:08

ancalita self-requested a review September 13, 2023 10:44

ancalita reviewed Sep 13, 2023

View reviewed changes

ottonemo added 5 commits September 26, 2023 19:11

Formatting

3975fa7

Address review comment: add comment

ba5f66d

Review: add changelog

b0a7c6d

Refactor and test function to retrieve intent names

ee1dd88

Since the main problem of bug OSS-413 is that intents with attributes are not retrieved well the test was implemented to use domains with and without intent definitions using attributes.

ottonemo force-pushed the issue/OS-413 branch from c1f2793 to ee1dd88 Compare September 26, 2023 17:11

ottonemo requested a review from ancalita September 26, 2023 17:52

ancalita approved these changes Sep 27, 2023

View reviewed changes

ancalita merged commit e37774e into RasaHQ:3.6.x Sep 27, 2023
95 of 96 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix OSS-413: Proper intents in interactive training #12722

Fix OSS-413: Proper intents in interactive training #12722

ottonemo commented Aug 9, 2023 •

edited

ancalita left a comment •

edited

ottonemo commented Aug 18, 2023

NitinKumar94 commented Sep 11, 2023

ancalita left a comment

ancalita Sep 13, 2023

ottonemo commented Sep 26, 2023

ancalita left a comment

		This will also fix a bug where forced user utterances (using the regex matcher) will
		be reverted even though they are present in the domain.

Fix OSS-413: Proper intents in interactive training #12722

Fix OSS-413: Proper intents in interactive training #12722

Conversation

ottonemo commented Aug 9, 2023 • edited

ancalita left a comment • edited

Choose a reason for hiding this comment

ottonemo commented Aug 18, 2023

NitinKumar94 commented Sep 11, 2023

ancalita left a comment

Choose a reason for hiding this comment

ancalita Sep 13, 2023

Choose a reason for hiding this comment

ottonemo commented Sep 26, 2023

ancalita left a comment

Choose a reason for hiding this comment

ottonemo commented Aug 9, 2023 •

edited

ancalita left a comment •

edited