Skip to content

[SPARK-56084][PYTHON] Remove unused ArrowStreamGroupUDFSerializer#55963

Closed
Yicong-Huang wants to merge 1 commit into
apache:masterfrom
Yicong-Huang:SPARK-56084/cleanup/grouped-arrow-udf-serializer
Closed

[SPARK-56084][PYTHON] Remove unused ArrowStreamGroupUDFSerializer#55963
Yicong-Huang wants to merge 1 commit into
apache:masterfrom
Yicong-Huang:SPARK-56084/cleanup/grouped-arrow-udf-serializer

Conversation

@Yicong-Huang
Copy link
Copy Markdown
Contributor

What changes were proposed in this pull request?

Delete ArrowStreamGroupUDFSerializer from python/pyspark/sql/pandas/serializers.py.

Why are the changes needed?

ArrowStreamGroupUDFSerializer is no longer used after SPARK-55608 refactored SQL_GROUPED_MAP_ARROW_UDF / SQL_GROUPED_MAP_ARROW_ITER_UDF to use ArrowStreamGroupSerializer directly, and SPARK-56860 removed its only subclass CogroupArrowUDFSerializer. This class can be safely deleted.

Part of SPARK-55384.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Existing tests: pyspark.sql.tests.arrow.test_arrow_grouped_map.

Was this patch authored or co-authored using generative AI tooling?

No.

zhengruifeng pushed a commit that referenced this pull request May 19, 2026
### What changes were proposed in this pull request?

Delete `ArrowStreamGroupUDFSerializer` from `python/pyspark/sql/pandas/serializers.py`.

### Why are the changes needed?

`ArrowStreamGroupUDFSerializer` is no longer used after SPARK-55608 refactored `SQL_GROUPED_MAP_ARROW_UDF` / `SQL_GROUPED_MAP_ARROW_ITER_UDF` to use `ArrowStreamGroupSerializer` directly, and SPARK-56860 removed its only subclass `CogroupArrowUDFSerializer`. This class can be safely deleted.

Part of SPARK-55384.

### Does this PR introduce _any_ user-facing change?

No.

### How was this patch tested?

Existing tests: `pyspark.sql.tests.arrow.test_arrow_grouped_map`.

### Was this patch authored or co-authored using generative AI tooling?

No.

Closes #55963 from Yicong-Huang/SPARK-56084/cleanup/grouped-arrow-udf-serializer.

Authored-by: Yicong Huang <17627829+Yicong-Huang@users.noreply.github.com>
Signed-off-by: Ruifeng Zheng <ruifengz@foxmail.com>
(cherry picked from commit 164dbfb)
Signed-off-by: Ruifeng Zheng <ruifengz@foxmail.com>
@zhengruifeng
Copy link
Copy Markdown
Contributor

merged to master/4.x

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants