Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Random seeds and deterministic trainings #779

Closed
adrienball opened this issue Apr 5, 2019 · 1 comment
Closed

Random seeds and deterministic trainings #779

adrienball opened this issue Apr 5, 2019 · 1 comment

Comments

@adrienball
Copy link
Contributor

Description
For now, using the default configuration, calling several times the fit method of SnipsNLUEngine will produce slightly different engines. In many cases, it is useful to have reproducible trainings. The NLU engine configuration contains two random seed parameters at the intent classifier level and slot filler level, which are meant to enable this.

However there are currently two issues related to this:

  • a bug in the scikit-learn library was causing non-deterministic behaviors (see [MRG] Fix SGD non deterministic behavior scikit-learn/scikit-learn#13422). This should be fixed in the next version of scikit-learn, but in the meantime the seeds in Snips NLU do not work properly.
  • the use of random seeds is not very friendly at the moment, as you have to set two distinct parameters in the configuration file. Instead, we could have a single high level parameter which could be either in the configuration, or directly passed to the fit method.
@adrienball
Copy link
Contributor Author

Fixed in #801

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant