Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

New training data format (Initilal Implementation) #5970

Closed
degiz opened this issue Jun 8, 2020 · 0 comments
Closed

New training data format (Initilal Implementation) #5970

degiz opened this issue Jun 8, 2020 · 0 comments
Assignees
Labels
area:rasa-oss 🎡 Anything related to the open source Rasa framework type:enhancement ✨ Additions of new features or changes to existing ones, should be doable in a single PR

Comments

@degiz
Copy link
Contributor

degiz commented Jun 8, 2020

Umbrella ticket for the initial implementation of the New Training Data Format.

The latest format examples are in the notion.

We can logically split the work into three parts:

  1. NLU: needs new YAML parser + support separation of lookup tables. Issue: New Training Data Format - NLU #5983
  2. Stories: needs new YAML parser + support Rules. New Training Data Format - Stories #5996
  3. Domain: add support to split the file. Issue: New Training Data Format - Domain #5984

For the initial implementation IMO it's enough to implement only read functionality for the parsers, dump can be done later.

NLU and Stories are still using --data parameter, and --domain should accept the folder name.

Later we can decide is we want to allow the keys from domain (i.g. intents) to be mixed with for example NLU data.

Later we should also implement a converter from the existing format to the new one.

To a certain amount the work can be parallelized, so every point above can be implemented separately.

@degiz degiz added type:enhancement ✨ Additions of new features or changes to existing ones, should be doable in a single PR area:rasa-oss 🎡 Anything related to the open source Rasa framework labels Jun 8, 2020
@degiz degiz added this to the 2.0 Rasa Open Source milestone Jun 8, 2020
@degiz degiz self-assigned this Jun 8, 2020
@degiz degiz changed the title New training data format New training data format (Initilal Implementation) Jun 10, 2020
@degiz degiz closed this as completed Jul 9, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area:rasa-oss 🎡 Anything related to the open source Rasa framework type:enhancement ✨ Additions of new features or changes to existing ones, should be doable in a single PR
Projects
None yet
Development

No branches or pull requests

1 participant