Fix --output option to dump the model in a specific directory #124

lfoppiano · 2022-03-02T09:42:15Z

I've implemented this some time ago and it worked relatively well for normal models, however for transformers we realised it was not taken in account. With the new update to TF2 it seems to work fine for sequenceLabelling, except for the n-fold cross-validation.

In particular the problem seems to be related to the fact that the store of the model is hidden within the cross-validation. In my opinion we should call the model.save() after the n-fold cross-validation which will save either just one model (the best) or all of them (e.g. in case of ensemble).

My proposal is to give the wrapper a working/temporary directory and then explicitly save the model using model.save() and passing either the --output path or the default path within the data/models.

The text was updated successfully, but these errors were encountered:

lfoppiano · 2022-03-14T05:41:22Z

My suggestion would the following:

for train we should keep it as it is, the model is saved in the proper directory when calling model.save()
for train_eval, in particular n-fold cross-validation the model will be saved in a temporary directory and placed in the correct directory when calling model.save(), notice that the behaviour here could be to copy just the best model or all the models (e.g. envisioning a ensemble classification approach).
model.save() will default under data/xxx/model_name in case no parameter is specified

With this approach the wrapper will just need to know the tmp directory and if something fails delft will not pollute the data/ directory

What do you think?

kermitt2 · 2022-05-28T13:58:59Z

I think to clarify, there is no problem currently with --output option afaik.

The dir_path parameter in save() for the two wrappers is working fine in n-fold usage, both with and without transformer, in previous version and current one. The --output parameter in grobidTagger.py is working too - this is the only "applications" script using the --output parameter (it can be used to save directly a Grobid model under grobid home, with the right compatible Grobid model name).

Using the tmp folder will modify the saving mechanism to address #126, but the --output option is working without it in my tests.

lfoppiano · 2022-05-30T00:22:59Z

Maybe the current version is good now.

When I opened this issue it was the version 0.2.6 I think and at that time it was not working only in the case of scibert + 10fold or training (I can't remember exactly) because only config and preprocessor were saved correctly in the output directory, while the rest was saved in the default directory data/models/....

Since there is a new version we can close it from my side 🎉

lfoppiano mentioned this issue Mar 2, 2022

Transformers wrapper #123

Merged

lfoppiano linked a pull request Mar 28, 2022 that will close this issue

Improve temporary data management using temp directories #133

Open

lfoppiano self-assigned this Mar 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix --output option to dump the model in a specific directory #124

Fix --output option to dump the model in a specific directory #124

lfoppiano commented Mar 2, 2022

lfoppiano commented Mar 14, 2022

kermitt2 commented May 28, 2022

lfoppiano commented May 30, 2022

Fix --output option to dump the model in a specific directory #124

Fix --output option to dump the model in a specific directory #124

Comments

lfoppiano commented Mar 2, 2022

lfoppiano commented Mar 14, 2022

kermitt2 commented May 28, 2022

lfoppiano commented May 30, 2022