Fix R NA check for databricks-utils.R #8429

jinzhang21 · 2023-05-12T23:28:35Z

Related Issues/PRs

#xxx

What changes are proposed in this pull request?

See title. This is related to R 4.3.0 change

Calling && or || with LHS or (if evaluated) RHS of length greater than one is now always an error, with a report of the form

    'length = 4' in coercion to 'logical(1)'
Environment variable _R_CHECK_LENGTH_1_LOGIC2_ no longer has any effect.

Cited here.

How is this patch tested?

Existing unit/integration tests
New unit/integration tests
Manual tests (describe details, including test results, below)

Does this PR change the documentation?

No. You can skip the rest of this section.
Yes. Make sure the changed pages / sections render correctly in the documentation preview.

Release Notes

Is this a user-facing change?

No. You can skip the rest of this section.
Yes. Give a description of this change to be included in the release notes for MLflow users.

(Details in 1-2 sentences. You can just refer to another PR with a description if this PR is part of a larger change.)

What component(s), interfaces, languages, and integrations does this PR affect?

Components

Interface

area/uiux: Front-end, user experience, plotting, JavaScript, JavaScript dev server
area/docker: Docker use across MLflow's components, such as MLflow Projects and MLflow Models
area/sqlalchemy: Use of SQLAlchemy in the Tracking Service or Model Registry
area/windows: Windows support

Language

language/r: R APIs and clients
language/java: Java APIs and clients
language/new: Proposals for new client languages

Integrations

integrations/azure: Azure and Azure ML integrations
integrations/sagemaker: SageMaker integrations
integrations/databricks: Databricks integrations

How should the PR be classified in the release notes? Choose one:

rn/breaking-change - The PR will be mentioned in the "Breaking Changes" section
rn/none - No description will be included. The PR will be mentioned only by the PR number in the "Small Bugfixes and Documentation Updates" section
rn/feature - A new user-facing feature worth mentioning in the release notes
rn/bug-fix - A user-facing bug fix worth mentioning in the release notes
rn/documentation - A user-facing documentation change worth mentioning in the release notes

Signed-off-by: Jin Zhang <jin.zhang@databricks.com>

mlflow-automation · 2023-05-12T23:28:51Z

Documentation preview for cda9abf will be available here when this CircleCI job completes successfully.

More info

Ignore this comment if this PR does not change the documentation.
It takes a few minutes for the preview to be available.
The preview is updated when a new commit is pushed to this PR.
This comment was created by https://github.com/mlflow/mlflow/actions/runs/5000051036.

harupy · 2023-05-15T00:46:35Z

mlflow/R/mlflow/R/databricks-utils.R

@@ -161,7 +161,7 @@ mlflow_get_run_context.mlflow_databricks_client <- function(client, experiment_i
    } else {
      NA
    }
-    if (!is.na(job_info) && !is.na(job_info$job_id)) {
+    if (!all(is.na(job_info)) && !is.na(job_info$job_id)) {


Can we update the R version to 4.3.0 to make sure this line works fine with it?

mlflow/.github/workflows/r.yml

Lines 39 to 42 in ed44d8e

# Note: the version of R released on 4/21/23, 4.3.0, has build issues with devtools.

# Remove this version pin once issues on dependent packages have been fixed.

with:

r-version: "4.2.3"

There are some failures in tests on time duration output format. I can change the tests but is it desired? @harupy @WeichenXu123

── Failure ('test-tracking-runs.R:206:3'): mlflow_log_metric() rounds step and timestamp inputs ── mlflow_get_metric_history("timestamp_metric")$timestamp (`actual`) and purrr::map(round(timestamp_inputs), mlflow:::milliseconds_to_date) (`expected`) don't have the same values. * Only in `actual`: 1970-01-01 00:01:06.796, 1970-01-01 00:01:29.372, 1970-01-01 00:01:24.502, 1970-01-01 00:01:04.028, 1970-01-01 00:00:17.371, 1970-01-01 00:00:47.979, 1970-01-01 00:00:42.036, 1970-01-01 00:00:15.319, 1970-01-01 00:00:39.544, 1970-01-01 00:01:39.184 * Only in `expected`: 66.796, 89.372, 84.502, 64.028, 17.371, 47.979, 42.036, 15.319, 39.544, 99.184 ── Failure ('test-tracking-runs.R:259:3'): mlflow_log_metric() with step produces expected metric data ── metric_history_1$timestamp (`actual`) and purrr::map(c(300, 100, 200), mlflow:::milliseconds_to_date) (`expected`) don't have the same values. * Only in `actual`: 1970-01-01 00:00:00.3, 1970-01-01 00:00:00.1, 1970-01-01 00:00:00.2 * Only in `expected`: 0.3, 0.1, 0.2 ── Failure ('test-tracking-runs.R:439:3'): mlflow_log_batch() works ──────────── metrics$timestamp (`actual`) and purrr::map(c(200, 300, 400, 500, 600), mlflow:::milliseconds_to_date) (`expected`) don't have the same values. * Only in `actual`: 1970-01-01 00:00:00.4, 1970-01-01 00:00:00.3, 1970-01-01 00:00:00.2, 1970-01-01 00:00:00.5, 1970-01-01 00:00:00.6 * Only in `expected`: 0.2, 0.3, 0.4, 0.5, 0.6 ── Failure ('test-tracking-runs.R:453:3'): mlflow_log_batch() works ──────────── metric_history$timestamp (`actual`) and purrr::map(c(100, 200), mlflow:::milliseconds_to_date) (`expected`) don't have the same values. * Only in `actual`: 1970-01-01 00:00:00.1, 1970-01-01 00:00:00.2 * Only in `expected`: 0.1, 0.2

@jinzhang21 Let's fix them :)

Found R 4.3.0 contains changes related to dates and times: https://cran.r-project.org/doc/manuals/r-release/NEWS.html

harupy

LGTM once #8429 (comment) is addressed :)

Signed-off-by: Jin Zhang <jin.zhang@databricks.com>

Signed-off-by: harupy <hkawamura0130@gmail.com>

harupy · 2023-05-17T06:22:20Z

Merged master to reflect #8446. The R check should pass now.

Fix R NA check for databricks-utils.R

d870503

Signed-off-by: Jin Zhang <jin.zhang@databricks.com>

github-actions bot added language/r R APIs and clients rn/none List under Small Changes in Changelogs. labels May 12, 2023

jinzhang21 requested a review from harupy May 12, 2023 23:28

harupy reviewed May 15, 2023

View reviewed changes

harupy approved these changes May 15, 2023

View reviewed changes

WeichenXu123 approved these changes May 15, 2023

View reviewed changes

Update R workflow

b315fab

Signed-off-by: Jin Zhang <jin.zhang@databricks.com>

harupy mentioned this pull request May 17, 2023

Replace purrr::map with purrr::map_vec #8446

Merged

33 tasks

Merge branch 'master' into pr/jinzhang21/8429

cda9abf

Signed-off-by: harupy <hkawamura0130@gmail.com>

harupy enabled auto-merge (squash) May 17, 2023 10:19

harupy merged commit ea754e8 into mlflow:master May 17, 2023
26 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix R NA check for databricks-utils.R #8429

Fix R NA check for databricks-utils.R #8429

jinzhang21 commented May 12, 2023

mlflow-automation commented May 12, 2023 •

edited

harupy May 15, 2023

WeichenXu123 May 15, 2023

jinzhang21 May 15, 2023

harupy May 16, 2023

harupy May 16, 2023 •

edited

harupy left a comment

harupy commented May 17, 2023

	# Note: the version of R released on 4/21/23, 4.3.0, has build issues with devtools.
	# Remove this version pin once issues on dependent packages have been fixed.
	with:
	r-version: "4.2.3"

Fix R NA check for databricks-utils.R #8429

Fix R NA check for databricks-utils.R #8429

Conversation

jinzhang21 commented May 12, 2023

Related Issues/PRs

What changes are proposed in this pull request?

How is this patch tested?

Does this PR change the documentation?

Release Notes

Is this a user-facing change?

What component(s), interfaces, languages, and integrations does this PR affect?

How should the PR be classified in the release notes? Choose one:

mlflow-automation commented May 12, 2023 • edited

harupy May 15, 2023

Choose a reason for hiding this comment

WeichenXu123 May 15, 2023

Choose a reason for hiding this comment

jinzhang21 May 15, 2023

Choose a reason for hiding this comment

harupy May 16, 2023

Choose a reason for hiding this comment

harupy May 16, 2023 • edited

Choose a reason for hiding this comment

harupy left a comment

Choose a reason for hiding this comment

harupy commented May 17, 2023

mlflow-automation commented May 12, 2023 •

edited

harupy May 16, 2023 •

edited