-
Notifications
You must be signed in to change notification settings - Fork 441
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Added ability to return pandas dataframes from SparkSQL for Python and Scala #20
Conversation
Pandas df is now returned for sql pyspark
It did not display before because magic was not returning it.
1. Create pandas df for scala and pyspark with one call only 2. Create sqlContext from session start 3. If there are no rows, no schema is shown :( We could detect that and default to previous two call method. Need to fix unit tests. Going to sleep row now
Need to add unit tests for new functionality
|
||
.idea/sparkmagic.iml | ||
|
||
.idea/workspace.xml |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Minor nitpick: these can all be combined to .idea/*
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks!
Execution always returns result. Combined .gitignore entries.
With this last commit, I think we are good for now, unless you have more comments on the previous points. Please take a look @alope107. Thank you! |
I have the mentioned minor nitpicks, but I don't think they need to be included in this PR. Looks good to me. |
Added ability to return pandas dataframes from SparkSQL for Python and Scala
With unit tests. Also, made unit tests run faster.