Add Databricks config to Spark config #351

pingsutw · 2022-12-07T21:59:43Z

TL;DR

This config is used by the dbx plugin in propeller to submit the dbx job request.

Type

Bug Fix
Feature
Plugin

Are all requirements met?

Complete description

^^^

Tracking Issue

flyteorg/flyte#3173

Signed-off-by: Kevin Su <pingsutw@apache.org>

codecov · 2022-12-07T22:46:17Z

Codecov Report

Merging #351 (4257652) into master (305c51e) will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master     #351   +/-   ##
=======================================
  Coverage   73.12%   73.12%           
=======================================
  Files          18       18           
  Lines        1362     1362           
=======================================
  Hits          996      996           
  Misses        315      315           
  Partials       51       51

Flag	Coverage Δ
unittests	`73.12% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

wild-endeavor · 2022-12-09T22:07:55Z

protos/flyteidl/plugins/spark.proto

@@ -21,4 +21,8 @@ message SparkJob {
    map<string, string> sparkConf = 4;
    map<string, string> hadoopConf = 5;
    string executorPath = 6; // Executor path for Python jobs.
+    // databricksConf is base64 encoded string which stores databricks job configuration.


why is the databricks one a b64 encoded string while the spark and hadoop conf's above just a map of str, str? Is the databricks one a binary object that needs to be encoded?

Because it's nested dict. wait, I think I can use Struct here. Let me update it

wild-endeavor

approve but maybe address question in a comment?

Signed-off-by: Kevin Su <pingsutw@apache.org>

* databricks plugin Signed-off-by: Kevin Su <pingsutw@apache.org> * update comment Signed-off-by: Kevin Su <pingsutw@apache.org> * Use struct instead of string Signed-off-by: Kevin Su <pingsutw@apache.org> * Add token Signed-off-by: Kevin Su <pingsutw@apache.org> * nit Signed-off-by: Kevin Su <pingsutw@apache.org> * add instance name Signed-off-by: Kevin Su <pingsutw@apache.org> * add instance name Signed-off-by: Kevin Su <pingsutw@apache.org> Signed-off-by: Kevin Su <pingsutw@apache.org>

pingsutw added 3 commits December 1, 2022 13:57

databricks plugin

8b886d7

Signed-off-by: Kevin Su <pingsutw@apache.org>

update comment

8693c48

Signed-off-by: Kevin Su <pingsutw@apache.org>

Merged master

b665e61

Signed-off-by: Kevin Su <pingsutw@apache.org>

pingsutw mentioned this pull request Dec 7, 2022

Add Databricks config to Spark Job flyteorg/flytekit#1358

Merged

8 tasks

wild-endeavor reviewed Dec 9, 2022

View reviewed changes

wild-endeavor previously approved these changes Dec 9, 2022

View reviewed changes

Use struct instead of string

561bdc4

Signed-off-by: Kevin Su <pingsutw@apache.org>

pingsutw dismissed wild-endeavor’s stale review via 561bdc4 December 12, 2022 23:50

pingsutw added 4 commits December 14, 2022 14:19

Add token

1778d70

Signed-off-by: Kevin Su <pingsutw@apache.org>

nit

c9da113

Signed-off-by: Kevin Su <pingsutw@apache.org>

add instance name

58a4ca9

Signed-off-by: Kevin Su <pingsutw@apache.org>

add instance name

4257652

Signed-off-by: Kevin Su <pingsutw@apache.org>

wild-endeavor approved these changes Dec 15, 2022

View reviewed changes

eapolinario approved these changes Dec 16, 2022

View reviewed changes

pingsutw merged commit fd208b7 into master Dec 17, 2022

pingsutw deleted the databricks branch December 17, 2022 02:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Databricks config to Spark config #351

Add Databricks config to Spark config #351

pingsutw commented Dec 7, 2022 •

edited by wild-endeavor

codecov bot commented Dec 7, 2022 •

edited

wild-endeavor Dec 9, 2022

pingsutw Dec 9, 2022

tmathur-hbo Dec 11, 2022

wild-endeavor left a comment

Add Databricks config to Spark config #351

Add Databricks config to Spark config #351

Conversation

pingsutw commented Dec 7, 2022 • edited by wild-endeavor

TL;DR

Type

Are all requirements met?

Complete description

Tracking Issue

codecov bot commented Dec 7, 2022 • edited

Codecov Report

wild-endeavor Dec 9, 2022

Choose a reason for hiding this comment

pingsutw Dec 9, 2022

Choose a reason for hiding this comment

tmathur-hbo Dec 11, 2022

Choose a reason for hiding this comment

wild-endeavor left a comment

Choose a reason for hiding this comment

pingsutw commented Dec 7, 2022 •

edited by wild-endeavor

codecov bot commented Dec 7, 2022 •

edited