Added ability to return pandas dataframes from SparkSQL for Python and Scala #20

aggFTW · 2015-10-01T02:34:24Z

With unit tests. Also, made unit tests run faster.

…peline2

Pandas df is now returned for sql pyspark

It did not display before because magic was not returning it.

1. Create pandas df for scala and pyspark with one call only 2. Create sqlContext from session start 3. If there are no rows, no schema is shown :( We could detect that and default to previous two call method. Need to fix unit tests. Going to sleep row now

Need to add unit tests for new functionality

aggFTW · 2015-10-01T06:30:03Z

Closes #9. Closes #15.

alope107 · 2015-10-01T22:01:19Z

.gitignore

+
+.idea/sparkmagic.iml
+
+.idea/workspace.xml


Minor nitpick: these can all be combined to .idea/*

Execution always returns result. Combined .gitignore entries.

aggFTW · 2015-10-01T22:56:55Z

With this last commit, I think we are good for now, unless you have more comments on the previous points. Please take a look @alope107. Thank you!

alope107 · 2015-10-02T20:28:53Z

I have the mentioned minor nitpicks, but I don't think they need to be included in this PR. Looks good to me.

Added ability to return pandas dataframes from SparkSQL for Python and Scala

Alejandro Guerrero Gonzalez added 9 commits September 29, 2015 19:08

Merge remote-tracking branch 'jupyter-incubator/master' into visualpi…

ff33007

…peline2

Pandas pyspark client

230d8e8

Pandas df is now returned for sql pyspark

Fix display of pandas dataframe

e912316

It did not display before because magic was not returning it.

Fixed existing unit tests

5e482d7

Need to add unit tests for new functionality

Added pandas Scala livy client

00ea0ec

Added ability to display at least columns when no records are shown

11ea27a

Removed LivyClient from client factory

baaa1c4

Fixed typo

a460e13

alope107 reviewed Oct 1, 2015
View reviewed changes

Addressed PR comments

6f2a454

Execution always returns result. Combined .gitignore entries.

aggFTW added a commit that referenced this pull request Oct 2, 2015

Merge pull request #20 from aggFTW/visualpipeline2

8afab25

Added ability to return pandas dataframes from SparkSQL for Python and Scala

aggFTW merged commit 8afab25 into jupyter-incubator:master Oct 2, 2015

aggFTW deleted the visualpipeline2 branch October 2, 2015 21:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added ability to return pandas dataframes from SparkSQL for Python and Scala #20

Added ability to return pandas dataframes from SparkSQL for Python and Scala #20

aggFTW commented Oct 1, 2015

aggFTW commented Oct 1, 2015

alope107 Oct 1, 2015

aggFTW Oct 1, 2015

aggFTW commented Oct 1, 2015

alope107 commented Oct 2, 2015


		.idea/sparkmagic.iml

		.idea/workspace.xml

Added ability to return pandas dataframes from SparkSQL for Python and Scala #20

Added ability to return pandas dataframes from SparkSQL for Python and Scala #20

Conversation

aggFTW commented Oct 1, 2015

aggFTW commented Oct 1, 2015

alope107 Oct 1, 2015

Choose a reason for hiding this comment

aggFTW Oct 1, 2015

Choose a reason for hiding this comment

aggFTW commented Oct 1, 2015

alope107 commented Oct 2, 2015