Skip to content

[SPARK-52307][PYTHON][CONNECT] Support Scalar Arrow Iterator UDF #51018

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 3 commits into from

Conversation

zhengruifeng
Copy link
Contributor

What changes were proposed in this pull request?

Support Scalar Arrow Iterator UDF

Why are the changes needed?

to make Arrow UDF suitable for expansive initialization within each partition

Does this PR introduce any user-facing change?

Not yet, the whole Arrow UDF is private for now

How was this patch tested?

New Tests

Was this patch authored or co-authored using generative AI tooling?

No

empty
@xinrong-meng
Copy link
Member

LGTM thank you!

@zhengruifeng
Copy link
Contributor Author

thanks, merged to master

@zhengruifeng zhengruifeng deleted the py_arrow_udf_iter branch June 3, 2025 01:25
yhuang-db pushed a commit to yhuang-db/spark that referenced this pull request Jun 9, 2025
### What changes were proposed in this pull request?
Support Scalar Arrow Iterator UDF

### Why are the changes needed?
to make Arrow UDF suitable for expansive initialization within each partition

### Does this PR introduce _any_ user-facing change?
Not yet, the whole Arrow UDF is private for now

### How was this patch tested?
New Tests

### Was this patch authored or co-authored using generative AI tooling?
No

Closes apache#51018 from zhengruifeng/py_arrow_udf_iter.

Authored-by: Ruifeng Zheng <ruifengz@apache.org>
Signed-off-by: Ruifeng Zheng <ruifengz@apache.org>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants