feat: get diff between latest recommendation and cluster schema [PROD-1686] #103

eheinlein-sync · 2024-03-04T17:44:20Z

Summary

We're adding functionality to display cluster recommendations, as well as a side by side comparison and a merged cluster definition.

Checklist

Before formally opening this PR, please adhere to the following standards:

Branch/PR names begin with the related Jira ticket id (ie PROD-31) for Jira integration
File names are lower_snake_case
Relevant unit tests have been added or not applicable
Relevant documentation has been added or not applicable
Mark yourself as the assignee (makes it easier to scan the PR list)

Related Jira Ticket (PROD-1686)

Add any relevant testing examples or screenshots.

sync/clients/sync.py

moving methods to projects.py finalizing changes for tests

…code/syncsparkpy into PROD-1686/get_diff

singhals · 2024-03-09T16:56:53Z

sync/models.py

+    spark_conf: Dict
+    spark_version: str
+    runtime_engine: str
+    aws_attributes: Dict


aws_attributes in the azure config?

sync/models.py

singhals · 2024-03-11T01:44:36Z

sync/api/projects.py

+    response_str = json.dumps(recommendation_response.result)
+    return Response(
+        result={
+            "cluster_recommendations": json.loads(response_str),


Its one recommendation, no? "cluster_recommendations" -> "cluster_recommendation"?

singhals · 2024-03-11T01:48:15Z

sync/api/projects.py

+def get_updated_cluster_defintion(
+    project_id: str, cluster_spec_str: str
+) -> Response[Union[AWSProjectConfiguration, AzureProjectConfiguration]]:
+    """Print Cluster Definition merged with Project Configuration Recommendations.


I think you meant "Return" and not "Print" here

singhals · 2024-03-11T01:55:22Z

sync/api/projects.py

+        # Convert Response result object to str
+        latest_rec_str = json.dumps(rec_response.result)
+        # Convert json string to json
+        latest_recommendation = json.loads(latest_rec_str)
+        cluster_definition = json.loads(cluster_spec_str)
+        for key in latest_recommendation.keys():
+            cluster_definition[key] = latest_recommendation[key]
+
+        # instance_source and driver_instance_source are not
+        # included in recommendation and need to be updated as well
+        driver_recommendation = cluster_definition["node_type_id"]
+        cluster_definition["instance_source"] = {"node_type_id": driver_recommendation}
+        cluster_definition["driver_instance_source"] = {"node_type_id": driver_recommendation}


We should do a deep merge here like we do when we apply a recommendation from the library:

syncsparkpy/sync/_databricks.py

Line 633 in dc7a217

def get_recommendation_cluster(

it might be good to review this pathway before merging this:

syncsparkpy/sync/_databricks.py

Line 524 in dc7a217

def apply_project_recommendation(

I think we are rewriting logic that exists already

taylorgaw · 2024-03-11T22:13:31Z

Makefile

 .PHONY: test
 test:
-	pytest
+	pytest -vv


This is consistent across our other repos for make test

taylorgaw force-pushed the PROD-1686/get_diff branch from 88e815c to 9a6a51b Compare March 6, 2024 21:41

taylorgaw marked this pull request as ready for review March 6, 2024 21:41

taylorgaw changed the title ~~WIP: get diff between latest recommendation and cluster schema~~ feat: get diff between latest recommendation and cluster schema [PROD-1686] Mar 6, 2024

taylorgaw requested review from brandon-kaplan, kartiknagappa, romainissynced and singhals March 6, 2024 21:42

taylorgaw force-pushed the PROD-1686/get_diff branch from 9a6a51b to c0510ce Compare March 6, 2024 21:43

feat: Compare cluster configs in Sync Library

138add5

taylorgaw force-pushed the PROD-1686/get_diff branch from c0510ce to 138add5 Compare March 6, 2024 21:52

romainissynced reviewed Mar 6, 2024

View reviewed changes

sync/clients/sync.py Outdated Show resolved Hide resolved

romainissynced reviewed Mar 6, 2024

View reviewed changes

sync/clients/sync.py Outdated Show resolved Hide resolved

romainissynced reviewed Mar 6, 2024

View reviewed changes

sync/clients/sync.py Outdated Show resolved Hide resolved

singhals requested changes Mar 6, 2024

View reviewed changes

sync/clients/sync.py Outdated Show resolved Hide resolved

sync/clients/sync.py Outdated Show resolved Hide resolved

syncbrian reviewed Mar 7, 2024

View reviewed changes

sync/clients/sync.py Show resolved Hide resolved

eheinlein-sync and others added 2 commits March 8, 2024 11:57

feat: Compare cluster configs in Sync Library

45a92d2

moving methods to projects.py finalizing changes for tests

Moving methods to projects.py and adding azure support

112af2d

taylorgaw requested review from romainissynced, singhals, syncbrian and taylorgaw March 8, 2024 16:59

taylorgaw added 6 commits March 8, 2024 12:04

linting

38c3b18

Removing redundant code left over from merge

d91508e

feat: Compare cluster configs in Sync Library

59ee99b

Removing redundant code left over from merge

9fad4db

Merge branch 'PROD-1686/get_diff' of https://github.com/synccomputing…

94883a3

…code/syncsparkpy into PROD-1686/get_diff

removing old files

0ac3d6b

singhals requested changes Mar 11, 2024

View reviewed changes

moving deep_merge to separate utility function

76ffeb6

taylorgaw reviewed Mar 11, 2024

View reviewed changes

Makefile

.PHONY: test

test:

pytest

pytest -vv

Copy link

Contributor

taylorgaw Mar 11, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is consistent across our other repos for make test

taylorgaw requested a review from singhals March 11, 2024 22:14

singhals approved these changes Mar 11, 2024

View reviewed changes

taylorgaw merged commit 12ad132 into main Mar 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: get diff between latest recommendation and cluster schema [PROD-1686] #103

feat: get diff between latest recommendation and cluster schema [PROD-1686] #103

Uh oh!

eheinlein-sync commented Mar 4, 2024 •

edited by taylorgaw

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

singhals Mar 9, 2024 •

edited

Loading

Uh oh!

Uh oh!

singhals Mar 11, 2024

Uh oh!

singhals Mar 11, 2024

Uh oh!

singhals Mar 11, 2024

Uh oh!

singhals Mar 11, 2024

Uh oh!

taylorgaw Mar 11, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

feat: get diff between latest recommendation and cluster schema [PROD-1686] #103

feat: get diff between latest recommendation and cluster schema [PROD-1686] #103

Uh oh!

Conversation

eheinlein-sync commented Mar 4, 2024 • edited by taylorgaw Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Checklist

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

singhals Mar 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

singhals Mar 11, 2024

Choose a reason for hiding this comment

Uh oh!

singhals Mar 11, 2024

Choose a reason for hiding this comment

Uh oh!

singhals Mar 11, 2024

Choose a reason for hiding this comment

Uh oh!

singhals Mar 11, 2024

Choose a reason for hiding this comment

Uh oh!

taylorgaw Mar 11, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

eheinlein-sync commented Mar 4, 2024 •

edited by taylorgaw

Loading

singhals Mar 9, 2024 •

edited

Loading