Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SW-2303] Create Sparkling Py4j Getaway for Steam #2173

Merged
merged 22 commits into from Jun 29, 2020
Merged

Conversation

jakubhava
Copy link
Contributor

@jakubhava jakubhava commented Jun 11, 2020

Steam is trying to get rid of Livy.

To achieve this, we can use the Py4j Gateway directly and allow the Spark python client to connect to remote Spark cluster without the Livy complexities.

@jakubhava jakubhava added next major release Goes into Major release work in progress WIP labels Jun 11, 2020
@jakubhava jakubhava force-pushed the jh/SW-2303 branch 5 times, most recently from 68c0901 to 18b0e7c Compare June 24, 2020 09:37
@jakubhava jakubhava changed the title [SW-2303] Create Sparkling Py4j Getaway [SW-2303] Create Sparkling Py4j Getaway for Steam Jun 24, 2020
@jakubhava jakubhava force-pushed the jh/SW-2303 branch 2 times, most recently from f6f70d6 to e462b8a Compare June 24, 2020 18:12
@jakubhava jakubhava requested a review from bilcus June 24, 2020 18:21
Copy link
Member

@bilcus bilcus left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks! Will try the nighly once it is out.

@jakubhava jakubhava added next fix release and removed next major release Goes into Major release labels Jun 25, 2020
Copy link
Collaborator

@mn-mikke mn-mikke left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@jakubhava
Copy link
Contributor Author

@bilcus I have implemented the automatic port selection. Steam can then obtain the port by reading the following endpoint:

FLOW_PROXY_URL/3/option?name=spark.ext.h2o.py4j.gateway.port

I wanted to keep it generic so the rest of our code doesn't contain references to the gateway code

@bilcus
Copy link
Member

bilcus commented Jun 26, 2020

@jakubhava
Nice, I like it, thanks!

@jakubhava jakubhava merged commit c840139 into master Jun 29, 2020
@jakubhava jakubhava deleted the jh/SW-2303 branch June 29, 2020 05:09
jakubhava added a commit that referenced this pull request Jun 29, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants