TensorBoard/Wandb/optuna/raytune integration improvements. #7935

madlag · 2020-10-20T17:30:22Z

Improves TensorBoard logging by grouping train / eval metrics as it is usually done in TensorBoard.
Improves TensorBoard/optuna model hyper-parameters logging.
Improves optuna and Ray/tune integration, and provides model hyper-parameter naming.

Test (and sample code) is provided in test_trainer.TrainerHyperParameterIntegrationTest .

Some more work may be need to harmonize metrics naming for eval / train, as the "eval_" prefix used is not very convenient, using a "eval/" prefix would be more foolproof, and consistent with TensorBoard usage, but it would break quite some code, and so may be done in a separate PR.

…tune support, with minor modifications to trainer core code.

sgugger

I don't think changing the metrics names in the logs is a good idea as there are other components relying on them. Renaming the keys inside the TensorBoardCallback to make tensorboard happier seems fine but outside of it, I would prefer to avoid any breaking change.

src/transformers/testing_utils.py

src/transformers/trainer.py

src/transformers/trainer_utils.py

tests/test_trainer.py

sgugger

Thanks for fixing/making all of this better!

…ce#7935) Improved TensorBoard and Wandb integration, as well as optuna and ray/tune support, with minor modifications to trainer core code.

…uggingface#7935)" This reverts commit e07eb0d.

madlag requested a review from sgugger October 20, 2020 17:30

Improved TensorBoard and Wandb integration as well as optuna and ray/…

5191210

…tune support, with minor modifications to trainer core code.

madlag force-pushed the optuna_integration_improve branch from 3a7a0ee to 5191210 Compare October 20, 2020 19:35

sgugger reviewed Oct 20, 2020

View reviewed changes

madlag added 3 commits October 21, 2020 11:59

Moving trial information to state.

3179a1f

Style fix in test_trainer.py

1c4d608

Removed function used only once.

d83d80d

madlag force-pushed the optuna_integration_improve branch from e339f01 to 55fa620 Compare October 21, 2020 13:44

Do not rename log keys, do it in integrations.

d57d45f

madlag force-pushed the optuna_integration_improve branch from 55fa620 to d57d45f Compare October 21, 2020 14:40

sgugger approved these changes Oct 21, 2020

View reviewed changes

madlag merged commit e174bfe into huggingface:master Oct 21, 2020

madlag deleted the optuna_integration_improve branch October 21, 2020 15:19

fabiocapsouza added a commit to fabiocapsouza/transformers that referenced this pull request Nov 15, 2020

Revert "TensorBoard/Wandb/optuna/raytune integration improvements. (h…

d24f2a8

…uggingface#7935)" This reverts commit e07eb0d.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TensorBoard/Wandb/optuna/raytune integration improvements. #7935

TensorBoard/Wandb/optuna/raytune integration improvements. #7935

madlag commented Oct 20, 2020

sgugger left a comment

sgugger left a comment

TensorBoard/Wandb/optuna/raytune integration improvements. #7935

TensorBoard/Wandb/optuna/raytune integration improvements. #7935

Conversation

madlag commented Oct 20, 2020

sgugger left a comment

Choose a reason for hiding this comment

sgugger left a comment

Choose a reason for hiding this comment