Different results from `allennlp tune` and `allennlp retrain` with transformers #45

MagiaSN · 2021-06-01T07:42:34Z

When I am tuning a transformer model, I get different results from allennlp tune and allennlp retrain with the same hyperparameters.

I found this is caused by allennlp.common.cached_transformers module, which only constructs the model in the first trial (which would consume some random numbers), and uses the cached model in trials afterwards (which would not consume random numbers), leading to inconsistent results between tune and retrain.

The text was updated successfully, but these errors were encountered:

MagiaSN · 2021-06-01T07:45:37Z

#46 fixes this for me, but it has to access allennlp private interfaces, is this acceptable?

himkt · 2021-06-01T14:42:23Z

@MagiaSN Thank you so much for the investigation! I think we have to fix it.
However, it would be better to clear the cache in the Optuna's AllenNLPExecutor, as allennlp-optuna is a simple wrapper of Optuna and AllenNLPExecutor (in Optuna) invokes AllenNLP functionalities.

Would you mind sending the PR to Optuna? I would review your PR if you could send it.
And could you please some small reproducible configuration? It would be really helpful.

MagiaSN · 2021-06-01T17:45:48Z

@himkt Fine, I have opened an issue in Optuna with reproducible scripts optuna/optuna#2716 and a PR optuna/optuna#2717

MagiaSN · 2021-06-06T06:05:47Z

Since this is fixed in optuna, I am closing this now :)

MagiaSN mentioned this issue Jun 1, 2021

Fix different results from allennlp tune and allennlp retrain with transformers #46

Closed

himkt self-assigned this Jun 1, 2021

himkt added the bug Something isn't working label Jun 1, 2021

MagiaSN mentioned this issue Jun 1, 2021

Results not reproducible when running AllenNLPExecutor multiple times with transformers optuna/optuna#2716

Closed

MagiaSN closed this as completed Jun 6, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Different results from `allennlp tune` and `allennlp retrain` with transformers #45

Different results from `allennlp tune` and `allennlp retrain` with transformers #45

MagiaSN commented Jun 1, 2021

MagiaSN commented Jun 1, 2021

himkt commented Jun 1, 2021

MagiaSN commented Jun 1, 2021

MagiaSN commented Jun 6, 2021

Different results from allennlp tune and allennlp retrain with transformers #45

Different results from allennlp tune and allennlp retrain with transformers #45

Comments

MagiaSN commented Jun 1, 2021

MagiaSN commented Jun 1, 2021

himkt commented Jun 1, 2021

MagiaSN commented Jun 1, 2021

MagiaSN commented Jun 6, 2021

Different results from `allennlp tune` and `allennlp retrain` with transformers #45

Different results from `allennlp tune` and `allennlp retrain` with transformers #45