Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Deprecate inline lookup tables #4871

Merged
merged 13 commits into from Dec 11, 2019
Merged

Conversation

wochinge
Copy link
Contributor

@wochinge wochinge commented Nov 28, 2019

Proposed changes:

Status (please check what you already did):

  • added some tests for the functionality
  • updated the documentation
  • updated the changelog (please check changelog for instructions)
  • reformat files using black (please check Readme for instructions)

Rasa X does not support inline lookup tables. To be consistent these are also deprecated in ``rasa``.
@wochinge
Copy link
Contributor Author

wait, I should also update the docs. Gonna do that in a sec.

Copy link
Contributor

@federicotdn federicotdn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks!! This simplifies our training data format.

@@ -67,7 +62,9 @@ learn the domain with fewer examples and also help it be more confident of its p
Synonyms will map extracted entities to the same name, for example mapping "my savings account" to simply "savings".
However, this only happens *after* the entities have been extracted, so you need to provide examples with the synonyms present so that Rasa can learn to pick them up.

Lookup tables may be specified either directly as lists or as txt files containing newline-separated words or phrases. Upon loading the training data, these files are used to generate case-insensitive regex patterns that are added to the regex features. For example, in this case a list of currency names is supplied so that it is easier to pick out this entity.
Lookup tables may be specified as txt files containing newline-separated words or
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would change txt for plain text

docs/nlu/training-data-format.rst Outdated Show resolved Hide resolved
@tmbo
Copy link
Member

tmbo commented Dec 10, 2019

I guess this should go into the minor next week as well?

@wochinge
Copy link
Contributor Author

yes

@tmbo tmbo added this to the Rasa 1.6 milestone Dec 10, 2019
@wochinge wochinge merged commit 322da75 into master Dec 11, 2019
@wochinge wochinge deleted the deprecate-inline-lookup-tables branch December 11, 2019 17:38
@cristianmtr
Copy link
Contributor

So how exactly should these look like now? Also, do you support multiple files per lookup?

@wochinge
Copy link
Contributor Author

wochinge commented Apr 6, 2020

Hi @cristianmtr ,

multiple files per lookup currently don't work. The lookup files should be provided as described in the documentation

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants