Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Databricks connection should be using SparkSession not HiveContext #1921

Merged
merged 1 commit into from Feb 20, 2019

Conversation

MrBago
Copy link
Contributor

@MrBago MrBago commented Feb 20, 2019

It seems that sparklyr is expect sc$state$hive_context to be a SparkSession for spark 2.x. This PR updates the "hive_context" to use the same SparkSession as the notebook or rstudio environment.

This PR brings the databricks connection in line with what's done for spark_shell_connection, https://github.com/rstudio/sparklyr/blob/e05190e953d313483660af6a2c6a16f4ae50fb86/R/shell_connection.R#L595-L608

@falaki
Copy link
Collaborator

falaki commented Feb 20, 2019

@javierluraschi we tested this on Databricks and seems to resolve the issue our customers observed when using ml_load() in multiple notebooks.

@javierluraschi javierluraschi merged commit 90466eb into sparklyr:master Feb 20, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants