Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TensorBoard/Wandb/optuna/raytune integration improvements. #7935

Merged
merged 5 commits into from
Oct 21, 2020

Conversation

madlag
Copy link
Contributor

@madlag madlag commented Oct 20, 2020

Improves TensorBoard logging by grouping train / eval metrics as it is usually done in TensorBoard.
Improves TensorBoard/optuna model hyper-parameters logging.
Improves optuna and Ray/tune integration, and provides model hyper-parameter naming.

Test (and sample code) is provided in test_trainer.TrainerHyperParameterIntegrationTest .

Some more work may be need to harmonize metrics naming for eval / train, as the "eval_" prefix used is not very convenient, using a "eval/" prefix would be more foolproof, and consistent with TensorBoard usage, but it would break quite some code, and so may be done in a separate PR.

@madlag madlag requested a review from sgugger October 20, 2020 17:30
…tune support, with minor modifications to trainer core code.
Copy link
Collaborator

@sgugger sgugger left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think changing the metrics names in the logs is a good idea as there are other components relying on them. Renaming the keys inside the TensorBoardCallback to make tensorboard happier seems fine but outside of it, I would prefer to avoid any breaking change.

src/transformers/testing_utils.py Show resolved Hide resolved
src/transformers/trainer.py Outdated Show resolved Hide resolved
src/transformers/trainer.py Outdated Show resolved Hide resolved
src/transformers/trainer.py Outdated Show resolved Hide resolved
src/transformers/trainer.py Outdated Show resolved Hide resolved
src/transformers/trainer_utils.py Show resolved Hide resolved
tests/test_trainer.py Outdated Show resolved Hide resolved
tests/test_trainer.py Outdated Show resolved Hide resolved
Copy link
Collaborator

@sgugger sgugger left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for fixing/making all of this better!

@madlag madlag merged commit e174bfe into huggingface:master Oct 21, 2020
@madlag madlag deleted the optuna_integration_improve branch October 21, 2020 15:19
fabiocapsouza pushed a commit to fabiocapsouza/transformers that referenced this pull request Nov 15, 2020
…ce#7935)

Improved TensorBoard and Wandb integration, as well as optuna and ray/tune support, with minor modifications to trainer core code.
fabiocapsouza added a commit to fabiocapsouza/transformers that referenced this pull request Nov 15, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants