Move user to tag #1230

acroz · 2019-05-09T14:20:18Z

What changes are proposed in this pull request?

Following #1188 and further discussion with @aarondav, this PR migrates storage of the user who started a run to tags.

In particular:

The tag mlflow.user has been added and is set on creation of a run.
The user_id field on runs has been marked as deprecated and subject to removal in a future release, as was done previously for source_name, source_type etc.
user_id has been removed as an argument from the Python and R create_run methods (it was not exposed on the Java client createRun method).
Clients continue to pass the user_id attribute, which is read from tags on run creation, for the duration of the deprecation period.
Documentation has been updated to reflect that user_id is deprecated.

I've also changed the logic for detecting system username from pwd.getpwuid(os.getuid())[0] to the standard library helper getpass.getuser, which should support Windows systems in most cases.

How is this patch tested?

These changes affect a number of existing unit and integration tests, which have been updated to the change as appropriate.

Release Notes

The run field user_id has been deprecated in favour of the new tag mlflow.user.
user_id has been removed as an argument from the create_run methods of the Python and R clients.

Is this a user-facing change?

No. You can skip the rest of this section.
Yes. Give a description of this change to be included in the release notes for MLflow users.

I've selected 'no' as the Python fluent interface and Java client interfaces are unchanged, and it seems likely as users would not often be using the user_id argument to mlflow_start_run in the R client.

What component(s) does this PR affect?

How should the PR be classified in the release notes? Choose one:

rn/breaking-change - The PR will be mentioned in the "Breaking Changes" section
rn/none - No description will be included. The PR will be mentioned only by the PR number in the "Small Bugfixes and Documentation Updates" section
rn/feature - A new user-facing feature worth mentioning in the release notes
rn/bug-fix - A user-facing bug fix worth mentioning in the release notes
rn/documentation - A user-facing documentation change worth mentioning in the release notes

This reverts commit e61a7a1. In the end, it's probably best just not to set any tags - this is the client, not a fluent layer.

This will provide username support on Windows as well as UNIX

aarondav · 2019-05-09T16:38:42Z

mlflow/protos/service.proto

@@ -454,6 +454,8 @@ message RunInfo {
  optional string experiment_id = 2;

  // User who initiated the run.
+  // This field is deprecated, and will be removed in a future MLflow release.


Maybe include a since/as of MLflow 1.0 in this comment

aarondav · 2019-05-09T16:45:05Z

I think we may want to go ahead and make sqlalchemy_store and file_store auto-upgrade the user attribute into a tag, so that any runs created in MLflow 1.0 will not require migration once we drop the attribute.

This is similar what we did for RUN_NAME here: https://github.com/mlflow/mlflow/pull/1188/files#diff-73334c51ec814d7e8b40d09d7d828f82L365

aarondav · 2019-05-09T16:50:16Z

mlflow/protos/service.proto

@@ -581,6 +583,8 @@ message CreateRun {
  optional string experiment_id = 1;

  // ID of the user executing the run.
+  // This field is deprecated, and will be removed in a future MLflow release.


jfyi - we are planning on adding a doc page specific to MLflow tags, as opposed to documenting within the REST API docs. I think someone may have accidentally deleted your previous docs, but those will be resurrected there.

acroz · 2019-05-09T16:52:50Z

I think we may want to go ahead and make sqlalchemy_store and file_store auto-upgrade the user attribute into a tag, so that any runs created in MLflow 1.0 will not require migration once we drop the attribute.

Do you mean to set it for new runs or to migrate existing runs?

In this PR, the user ID is being passed to the stores as both an attribute and a tag, so all runs created in MLflow 1.0 onwards would have both.

aarondav · 2019-05-09T16:55:16Z

Hmm, yeah, guess I was thinking for old clients, but you're right. Seems unnecessary.

akshaya-a · 2019-05-09T17:11:34Z

@akarloff @namikhai for awareness - i think we already started ignoring this in our server, thanks for the clarity!

acroz and others added 20 commits May 9, 2019 11:25

Add new tag containing executing username

6268861

Fix test_fluent tests

9c5ab62

Set user tags in projects

58256fc

Update client to read user from tags

536b077

Remove unused imports

14c1744

Update REST tracking tests to pass user as tag

6d4bbc1

Add note that user ID is deprecated

5f006dc

Add note to protos indicating that user_id is deprecated

3a221ad

Update REST docs to indicate that user_id is deprecated

811c7f7

[java] Set user tag and mark userId field as deprecated

e61a7a1

[R] Update client to send user as tag

9b08685

[R] Pass user as tag in tests

1cc3d6d

Update rest store tests

174b049

Update tracking tests

32f078c

Ignore linter warnings caused by test fixtures

a4993ae

Revert "[java] Set user tag and mark userId field as deprecated"

7f427c7

This reverts commit e61a7a1. In the end, it's probably best just not to set any tags - this is the client, not a fluent layer.

Add comment to java source on userId deprecation

d888a22

Fix test on Python 2.7

dedf318

Make the tests Python 2/3 agnostic

884336f

Use getpass to get system username

a709e6c

This will provide username support on Windows as well as UNIX

aarondav reviewed May 9, 2019

View reviewed changes

Tweak deprecation message

b97104e

aarondav reviewed May 9, 2019

View reviewed changes

aarondav merged commit f23648f into mlflow:master May 13, 2019

sueann added the rn/breaking-change Mention under Breaking Changes in Changelogs. label May 30, 2019

avflor pushed a commit to avflor/mlflow that referenced this pull request Aug 22, 2020

Move user attribute to a tag (mlflow#1230)

cfd02ea

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move user to tag #1230

Move user to tag #1230

acroz commented May 9, 2019 •

edited

aarondav May 9, 2019

acroz May 9, 2019

aarondav commented May 9, 2019 •

edited

aarondav May 9, 2019

acroz commented May 9, 2019

aarondav commented May 9, 2019

akshaya-a commented May 9, 2019

Move user to tag #1230

Move user to tag #1230

Conversation

acroz commented May 9, 2019 • edited

What changes are proposed in this pull request?

How is this patch tested?

Release Notes

Is this a user-facing change?

What component(s) does this PR affect?

How should the PR be classified in the release notes? Choose one:

aarondav May 9, 2019

Choose a reason for hiding this comment

acroz May 9, 2019

Choose a reason for hiding this comment

aarondav commented May 9, 2019 • edited

aarondav May 9, 2019

Choose a reason for hiding this comment

acroz commented May 9, 2019

aarondav commented May 9, 2019

akshaya-a commented May 9, 2019

acroz commented May 9, 2019 •

edited

aarondav commented May 9, 2019 •

edited