Skip to content

[SPARK-10496][SQL] Add DataFrame cumulative sum#12578

Closed
zhengruifeng wants to merge 1 commit intoapache:masterfrom
zhengruifeng:cum_sum
Closed

[SPARK-10496][SQL] Add DataFrame cumulative sum#12578
zhengruifeng wants to merge 1 commit intoapache:masterfrom
zhengruifeng:cum_sum

Conversation

@zhengruifeng
Copy link
Contributor

@zhengruifeng zhengruifeng commented Apr 21, 2016

What changes were proposed in this pull request?

Add DataFrame cumulative sum.
Given numeric columns, create new columns which are the cumulative sum.

How was this patch tested?

unit tests and manual tests

@rxin
Copy link
Contributor

rxin commented Apr 21, 2016

Thanks for contributing, but isn't this similar to just window function?

@SparkQA
Copy link

SparkQA commented Apr 21, 2016

Test build #56551 has finished for PR 12578 at commit 97af903.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@zhengruifeng
Copy link
Contributor Author

@rxin It is said in JIRA that window function is not efficient for a large number of rows in this problem. @jkbradley

@SparkQA
Copy link

SparkQA commented Oct 7, 2016

Test build #66499 has finished for PR 12578 at commit 97af903.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@rxin
Copy link
Contributor

rxin commented Oct 8, 2016

I'm going to move the discussions over to JIRA.

@zhengruifeng
Copy link
Contributor Author

This PR is out of date, I think it's time to close it.

@zhengruifeng zhengruifeng deleted the cum_sum branch January 5, 2017 02:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants