Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-9964] [PySpark] [SQL] PySpark DataFrameReader accept RDD of String for JSON #8444

Closed
wants to merge 6 commits into from

Conversation

yanboliang
Copy link
Contributor

PySpark DataFrameReader should could accept an RDD of Strings (like the Scala version does) for JSON, rather than only taking a path.
If this PR is merged, it should be duplicated to cover the other input types (not just JSON).

@SparkQA
Copy link

SparkQA commented Aug 26, 2015

Test build #41583 has finished for PR 8444 at commit 127717a.

  • This patch fails PySpark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Aug 26, 2015

Test build #41589 has finished for PR 8444 at commit f160ec4.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@rxin
Copy link
Contributor

rxin commented Aug 26, 2015

I don't think we should rename the name of the parameter since it could break compatibility. How about just changing the description?

@SparkQA
Copy link

SparkQA commented Aug 26, 2015

Test build #41611 has finished for PR 8444 at commit 3842a6b.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

elif isinstance(path, RDD):
return self._df(self._jreader.json(path._jrdd))
else:
raise Exception("path can be only string or RDD")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1

@SparkQA
Copy link

SparkQA commented Aug 27, 2015

Test build #41668 has finished for PR 8444 at commit b2d072d.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@rxin
Copy link
Contributor

rxin commented Aug 27, 2015

Thanks - I've merged this in master.

@asfgit asfgit closed this in ce97834 Aug 27, 2015
@yanboliang yanboliang deleted the spark-9964 branch August 27, 2015 10:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
4 participants