Write eval results to delta #10659

prithvikannan · 2023-12-08T22:32:35Z

Related Issues/PRs

#xxx

What changes are proposed in this pull request?

How is this PR tested?

Existing unit/integration tests
New unit/integration tests
Manual tests

Does this PR require documentation update?

Release Notes

Is this a user-facing change?

No. You can skip the rest of this section.
Yes. Give a description of this change to be included in the release notes for MLflow users.

What component(s), interfaces, languages, and integrations does this PR affect?

Components

Interface

area/uiux: Front-end, user experience, plotting, JavaScript, JavaScript dev server
area/docker: Docker use across MLflow's components, such as MLflow Projects and MLflow Models
area/sqlalchemy: Use of SQLAlchemy in the Tracking Service or Model Registry
area/windows: Windows support

Language

language/r: R APIs and clients
language/java: Java APIs and clients
language/new: Proposals for new client languages

Integrations

integrations/azure: Azure and Azure ML integrations
integrations/sagemaker: SageMaker integrations
integrations/databricks: Databricks integrations

How should the PR be classified in the release notes? Choose one:

rn/none - No description will be included. The PR will be mentioned only by the PR number in the "Small Bugfixes and Documentation Updates" section
rn/breaking-change - The PR will be mentioned in the "Breaking Changes" section
rn/feature - A new user-facing feature worth mentioning in the release notes
rn/bug-fix - A user-facing bug fix worth mentioning in the release notes
rn/documentation - A user-facing documentation change worth mentioning in the release notes

github-actions · 2023-12-08T22:32:54Z

Documentation preview for 57d7d2a will be available here when this CircleCI job completes successfully.

More info

Ignore this comment if this PR does not change the documentation.
It takes a few minutes for the preview to be available.
The preview is updated when a new commit is pushed to this PR.
This comment was created by https://github.com/mlflow/mlflow/actions/runs/7199999368.

mlflow/utils/_spark_utils.py

mlflow/models/evaluation/default_evaluator.py

dbczumar · 2023-12-12T00:07:28Z

mlflow/models/evaluation/default_evaluator.py

+            eval_table_spark.write.mode("overwrite").format("delta").saveAsTable(
+                self.eval_results_path
+            )


I think this should be an append operation and we should enable schema merge (by setting the spark.databricks.delta.schema.autoMerge.enabled) conf as well.

cc @alkispoly-db

We should definitely automerge the schema.

For append vs overwrite:

Append is the right thing to do if we add a new experiment to the same eval dataset.

Overwrite is the right thing to do if the eval dataset has changed since last processed.
Can we differentiate between the two cases when the call is made?

Makes sense! I think we should add an evaluator conf this. Our RAG solution can decide how to populate it

Sounds good. Introduced a new evaluator conf eval_results_mode to control that.

mlflow/models/evaluation/default_evaluator.py

Signed-off-by: Prithvi Kannan <prithvi.kannan@databricks.com>

This reverts commit c16ed30. Signed-off-by: Prithvi Kannan <prithvi.kannan@databricks.com>

Signed-off-by: Prithvi Kannan <prithvi.kannan@databricks.com>

tests/evaluate/test_default_evaluator_delta.py

mlflow/models/evaluation/default_evaluator.py

serena-ruan · 2023-12-13T12:42:54Z

tests/evaluate/test_default_evaluator_delta.py

+    shutil.rmtree(tmpdir)
+
+
+def test_write_to_delta_fails_with_invalid_mode(spark_session_with_tempdir):


Seems spark_session_with_tempdir is not used in this test?

its still passed into this test. the spark session fixture is not tied to the module, so that we can test test_write_to_delta_fails_without_spark above, and then in in this test spark available but an invalid mode.

Signed-off-by: Prithvi Kannan <prithvi.kannan@databricks.com>

prithvikannan · 2023-12-13T19:21:06Z

.github/workflows/master.yml

+          pytest tests/evaluate --ignore=tests/evaluate/test_default_evaluator_delta.py
+      - name: Run tests with delta
+        run: |
+          pytest tests/evaluate/test_default_evaluator_delta.py


making this change since the spark sessions from tests/evaluate/test_evaluation and tests/evaluate/test_default_evaluator are not fully isolated (ie. calling getOrCreate() in another file will pickup the spark session from these files).

for tests/evaluate/test_default_evaluator_delta.py we need to test the case with no spark session available, and test with a spark session that has the delta extension.

How are we testing this with both spark sessions and without it? We are ignoring it in the previous run?

Let me know if I am missing something?

@sunishsheth2009 , test_default_evaluator_delta has a different fixture to create a spark session with delta. we have a test with and without the fixture.

sunishsheth2009 · 2023-12-14T06:17:40Z

mlflow/models/evaluation/default_evaluator.py

+                    "mergeSchema", "true"
+                ).format("delta").saveAsTable(self.eval_results_path)
+            except Exception as e:
+                _logger.info(f"Saving eval table to delta table failed. Reason: {e}")


Should we change this to logger.warn instead?

i dont feel strongly either way, but figured that this might help with debugging during development.

prithvikannan added rn/feature Mention under Features in Changelogs. area/tracking Tracking service, tracking client APIs, autologging labels Dec 8, 2023

prithvikannan changed the title ~~[WIP] Write eval results to delta~~ Write eval results to delta Dec 11, 2023

prithvikannan added rn/none List under Small Changes in Changelogs. and removed rn/feature Mention under Features in Changelogs. labels Dec 11, 2023

prithvikannan requested a review from sunishsheth2009 December 11, 2023 23:59

prithvikannan marked this pull request as ready for review December 12, 2023 00:00

prithvikannan requested review from dbczumar and annzhang-db December 12, 2023 00:00

dbczumar reviewed Dec 12, 2023

View reviewed changes

mlflow/utils/_spark_utils.py Outdated Show resolved Hide resolved

dbczumar reviewed Dec 12, 2023

View reviewed changes

mlflow/models/evaluation/default_evaluator.py Outdated Show resolved Hide resolved

dbczumar reviewed Dec 12, 2023

View reviewed changes

mlflow/models/evaluation/default_evaluator.py Show resolved Hide resolved

prithvikannan added 8 commits December 11, 2023 23:19

Write eval results to delta

0eb6d94

Signed-off-by: Prithvi Kannan <prithvi.kannan@databricks.com>

_create_local_spark_session_for_evaluate

d3402c0

Signed-off-by: Prithvi Kannan <prithvi.kannan@databricks.com>

remove comment

4e56da6

Signed-off-by: Prithvi Kannan <prithvi.kannan@databricks.com>

empty

28da500

Signed-off-by: Prithvi Kannan <prithvi.kannan@databricks.com>

comments

37991b4

Signed-off-by: Prithvi Kannan <prithvi.kannan@databricks.com>

Revert "comments"

4ae8c1a

This reverts commit c16ed30. Signed-off-by: Prithvi Kannan <prithvi.kannan@databricks.com>

comments and tests

f91cf2e

Signed-off-by: Prithvi Kannan <prithvi.kannan@databricks.com>

merge schema

25e311a

Signed-off-by: Prithvi Kannan <prithvi.kannan@databricks.com>

prithvikannan force-pushed the eval-results-delta branch from 41241f3 to 25e311a Compare December 12, 2023 07:19

prithvikannan added 6 commits December 11, 2023 23:20

undo spark utils

370860c

Signed-off-by: Prithvi Kannan <prithvi.kannan@databricks.com>

fix invalid mode test

81b8918

Signed-off-by: Prithvi Kannan <prithvi.kannan@databricks.com>

venv

555b75e

Signed-off-by: Prithvi Kannan <prithvi.kannan@databricks.com>

move to new test file

c8ec04b

Signed-off-by: Prithvi Kannan <prithvi.kannan@databricks.com>

test

2cce5b5

Signed-off-by: Prithvi Kannan <prithvi.kannan@databricks.com>

empty

d84060c

Signed-off-by: Prithvi Kannan <prithvi.kannan@databricks.com>

serena-ruan reviewed Dec 13, 2023

View reviewed changes

prithvikannan added 3 commits December 13, 2023 08:05

Merge remote-tracking branch 'databricks/master' into eval-results-delta

321fa9b

comments

1313727

Signed-off-by: Prithvi Kannan <prithvi.kannan@databricks.com>

break out delta tests

57d7d2a

Signed-off-by: Prithvi Kannan <prithvi.kannan@databricks.com>

prithvikannan commented Dec 13, 2023

View reviewed changes

prithvikannan requested a review from dbczumar December 13, 2023 21:48

sunishsheth2009 approved these changes Dec 14, 2023

View reviewed changes

prithvikannan merged commit ccb0570 into mlflow:master Dec 14, 2023
36 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Write eval results to delta #10659

Write eval results to delta #10659

prithvikannan commented Dec 8, 2023

github-actions bot commented Dec 8, 2023 •

edited

dbczumar Dec 12, 2023

alkispoly-db Dec 12, 2023

dbczumar Dec 12, 2023

prithvikannan Dec 12, 2023

serena-ruan Dec 13, 2023

prithvikannan Dec 13, 2023

prithvikannan Dec 13, 2023 •

edited

sunishsheth2009 Dec 14, 2023

prithvikannan Dec 14, 2023

sunishsheth2009 Dec 14, 2023

prithvikannan Dec 14, 2023

		shutil.rmtree(tmpdir)


		def test_write_to_delta_fails_with_invalid_mode(spark_session_with_tempdir):

Write eval results to delta #10659

Write eval results to delta #10659

Conversation

prithvikannan commented Dec 8, 2023

Related Issues/PRs

What changes are proposed in this pull request?

How is this PR tested?

Does this PR require documentation update?

Release Notes

Is this a user-facing change?

What component(s), interfaces, languages, and integrations does this PR affect?

How should the PR be classified in the release notes? Choose one:

github-actions bot commented Dec 8, 2023 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

prithvikannan Dec 13, 2023 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

github-actions bot commented Dec 8, 2023 •

edited

prithvikannan Dec 13, 2023 •

edited