Get experiment_id from MLFlow only once instead of each training loop. #3394

patrickorlando · 2020-09-08T11:05:09Z

What does this PR do?

When using the MLFLow logger, the MLFLow client retrieves the experiment id from the MLFlowClient each time logger.experiment is called/accessed. This causes overhead during training and validation loops which is dramatic if the server is remote.

This PR checks whether the experiment_id is already defined (meaning it has already been retrieved) and does not make the call to MLFlow if so.

Fixes #3393

Before submitting

Was this discussed/approved via a Github issue? (no need for typos and docs improvements)
Did you read the contributor guideline, Pull Request section?
Did you make sure your PR does only one thing, instead of bundling different changes together? Otherwise, we ask you to create a separate PR for every change.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?
Did you verify new and existing tests pass locally with your changes?
If you made a notable change (that affects users), did you update the CHANGELOG?

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

Did you have fun?

Make sure you had fun coding 🙃

codecov · 2020-09-08T11:27:23Z

Codecov Report

Merging #3394 into master will decrease coverage by 2%.
The diff coverage is 100%.

@@           Coverage Diff            @@
##           master   #3394     +/-   ##
========================================
- Coverage      85%     83%     -2%     
========================================
  Files          98     102      +4     
  Lines        8072    9159   +1087     
========================================
+ Hits         6897    7611    +714     
- Misses       1175    1548    +373

rohitgr7

LGTM.

pytorch_lightning/loggers/mlflow.py

Borda

LGTM

awaelchli · 2020-09-09T07:19:14Z

@patrickorlando This test fails on master, could you add it to the file tests/loggers/test_mlflow.py ?

def test_mlflow_experiment_created_once(tmpdir):
    logger = MLFlowLogger('test', save_dir=tmpdir)
    get_experiment_name = logger.experiment.get_experiment_by_name
    with mock.patch.object(MlflowClient, 'get_experiment_by_name', wraps=get_experiment_name) as mocked:
        _ = logger.experiment
        _ = logger.experiment
        _ = logger.experiment
        assert mocked.call_count == 1

Thanks

awaelchli

requesting a test to make the bugfix complete, see my comment above

patrickorlando · 2020-09-09T07:47:09Z

@awaelchli I've added the test case, but I had to change

get_experiment_name = logger.experiment.get_experiment_by_name

to

get_experiment_name = logger._mlflow_client.get_experiment_by_name

It had already been called in the first case before it was mocked and the test was failing.

I've checked that this change still fails on master (commit: d438ad8a8db3e76d3ed4e3c6bc9b91d6b3266b8e)

    def test_mlflow_experiment_id_retrieved_once(tmpdir):
        logger = MLFlowLogger('test', save_dir=tmpdir)
        get_experiment_name = logger._mlflow_client.get_experiment_by_name
        with mock.patch.object(MlflowClient, 'get_experiment_by_name', wraps=get_experiment_name) as mocked:
            _ = logger.experiment
            _ = logger.experiment
            _ = logger.experiment
>           assert mocked.call_count == 1
E           AssertionError: assert 3 == 1
E            +  where 3 = <MagicMock name='get_experiment_by_name' id='140215984767184'>.call_count

tests/loggers/test_mlflow.py:53: AssertionError

pep8speaks · 2020-09-09T07:52:01Z

Hello @patrickorlando! Thanks for updating this PR.

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2020-09-09 09:13:44 UTC

awaelchli · 2020-09-09T08:04:03Z

oh I see, because I had it added to the top of the file. But this way is better 👍

mergify · 2020-09-09T09:12:36Z

This pull request is now in conflict... :(

Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>

…id only once

mergify bot requested a review from a team September 8, 2020 11:05

awaelchli added bug Something isn't working logger Related to the Loggers v1.0 allowed labels Sep 8, 2020

rohitgr7 approved these changes Sep 8, 2020

View reviewed changes

pytorch_lightning/loggers/mlflow.py Outdated Show resolved Hide resolved

mergify bot requested a review from a team September 8, 2020 17:30

Borda approved these changes Sep 8, 2020

View reviewed changes

mergify bot requested a review from a team September 8, 2020 22:16

Borda added the ready PRs ready to be merged label Sep 8, 2020

awaelchli requested changes Sep 9, 2020

View reviewed changes

mergify bot requested a review from a team September 9, 2020 07:20

awaelchli approved these changes Sep 9, 2020

View reviewed changes

Borda force-pushed the issue-3393 branch from 224fb92 to e7a582c Compare September 9, 2020 09:12

Patrick Orlando and others added 5 commits September 9, 2020 11:13

Get experiment_id from MLFlow only once instead of each training loop.

c9ab2b9

Apply suggestions from code review

eb3ec54

Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>

add test that asserts mlflow client is called to retrieve experiment …

ab986d8

…id only once

make pep8 happy

92568e1

logs

ce89185

Borda force-pushed the issue-3393 branch from e7a582c to ce89185 Compare September 9, 2020 09:13

Borda merged commit 656c1af into Lightning-AI:master Sep 9, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Get experiment_id from MLFlow only once instead of each training loop. #3394

Get experiment_id from MLFlow only once instead of each training loop. #3394

patrickorlando commented Sep 8, 2020 •

edited

Loading

codecov bot commented Sep 8, 2020 •

edited

Loading

rohitgr7 left a comment

Borda left a comment

awaelchli commented Sep 9, 2020 •

edited

Loading

awaelchli left a comment •

edited

Loading

patrickorlando commented Sep 9, 2020 •

edited

Loading

pep8speaks commented Sep 9, 2020 •

edited

Loading

awaelchli commented Sep 9, 2020

mergify bot commented Sep 9, 2020

Get experiment_id from MLFlow only once instead of each training loop. #3394

Get experiment_id from MLFlow only once instead of each training loop. #3394

Conversation

patrickorlando commented Sep 8, 2020 • edited Loading

What does this PR do?

Before submitting

PR review

Did you have fun?

codecov bot commented Sep 8, 2020 • edited Loading

Codecov Report

rohitgr7 left a comment

Choose a reason for hiding this comment

Borda left a comment

Choose a reason for hiding this comment

awaelchli commented Sep 9, 2020 • edited Loading

awaelchli left a comment • edited Loading

Choose a reason for hiding this comment

patrickorlando commented Sep 9, 2020 • edited Loading

pep8speaks commented Sep 9, 2020 • edited Loading

Comment last updated at 2020-09-09 09:13:44 UTC

awaelchli commented Sep 9, 2020

mergify bot commented Sep 9, 2020

patrickorlando commented Sep 8, 2020 •

edited

Loading

codecov bot commented Sep 8, 2020 •

edited

Loading

awaelchli commented Sep 9, 2020 •

edited

Loading

awaelchli left a comment •

edited

Loading

patrickorlando commented Sep 9, 2020 •

edited

Loading

pep8speaks commented Sep 9, 2020 •

edited

Loading