Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-43381][CONNECT] Make 'transformStatCov' lazy #41055

Closed
wants to merge 1 commit into from

Conversation

zhengruifeng
Copy link
Contributor

@zhengruifeng zhengruifeng commented May 5, 2023

What changes were proposed in this pull request?

Make 'transformStatCov' lazy

Why are the changes needed?

Existing implementation eagerly compute the cov, and return a local relation.

Even through currently message StatCov is only used in 'Dataset.stat.cov' which returns a double value, it still should be a bit beneficial if we make 'transformStatCov' lazy, since the creation of local relation is skipped.

Does this PR introduce any user-facing change?

No

How was this patch tested?

existing UT

@zhengruifeng
Copy link
Contributor Author

cc @HyukjinKwon

@HyukjinKwon
Copy link
Member

Merged to master.

@zhengruifeng zhengruifeng deleted the connect_lazy_cov branch May 7, 2023 23:40
LuciferYang pushed a commit to LuciferYang/spark that referenced this pull request May 10, 2023
### What changes were proposed in this pull request?
Make 'transformStatCov' lazy

### Why are the changes needed?
Existing implementation eagerly compute the `cov`, and return a local relation.

Even through currently `message StatCov` is only used in 'Dataset.stat.cov' which returns a double value, it still should be a bit beneficial if we make 'transformStatCov' lazy, since the creation of local relation is skipped.

### Does this PR introduce _any_ user-facing change?
No

### How was this patch tested?
existing UT

Closes apache#41055 from zhengruifeng/connect_lazy_cov.

Authored-by: Ruifeng Zheng <ruifengz@apache.org>
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
2 participants