Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-48295][PS] Turn on compute.ops_on_diff_frames by default #46602

Closed

Conversation

zhengruifeng
Copy link
Contributor

What changes were proposed in this pull request?

Turn on compute.ops_on_diff_frames by default

Why are the changes needed?

1, in most cases, this config need to be turned on to enable computation with different dataframes;
2, enable compute.ops_on_diff_frames should not break any workloads, it should only enable more;

Does this PR introduce any user-facing change?

yes, this config is turned on by default

How was this patch tested?

updated tests

Was this patch authored or co-authored using generative AI tooling?

no

@zhengruifeng
Copy link
Contributor Author

will add it to release note soon

Copy link
Contributor

@itholic itholic left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Otherwise LGTM when test passing

Comment on lines +494 to +495
>>> with ps.option_context("compute.ops_on_diff_frames", False):
... ps.DataFrame(data=sdf, index=pd.Index([0, 1, 2]))
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think maybe now we can just remove the negative cases from the doctests and keep only UTs??

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am not sure, @HyukjinKwon wdyt?

Comment on lines +513 to +514
>>> with ps.option_context("compute.ops_on_diff_frames", False):
... ps.DataFrame(data=sdf, index=ps.Index([0, 1, 2]))
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ditto and all belows?

@HyukjinKwon
Copy link
Member

Merged to master.

@zhengruifeng zhengruifeng deleted the turn_on_ops_on_diff_frames branch May 16, 2024 03:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
3 participants