Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
The training data is now only saved after the whole
repeated_self_training
script has run. There are several changes to thesample_config
that you have to adopt, most importantly the oldtrain_samples_pool_size
is nowsamples_per_model * num_data_models
.samples_per_model
indicates how many samples a model generates, andnum_data_models
of how many of the last models we use data from.The data is now saved under the same name as the model (in the data folder). You can start training with this data if you set
load_initial_data=True
(it should not be required to have the correct amout of data). Otherwise the initial training data is generated by aRandom
model.Please let me know if this works on your system!
(the important commit is only 383431b, i just wanted to merge anyway and use a pull request)