[SPARK-56607][PYTHON][FOLLOWUP] Use pyspark.sql.DataFrame to support connect-only by gaogaotiantian · Pull Request #55630 · apache/spark

gaogaotiantian · 2026-04-30T18:04:44Z

What changes were proposed in this pull request?

Use pyspark.sql.DataFrame, not the classic one, in mlutils.py.

Why are the changes needed?

We have connect only CI which does not even have class DataFrame. This util should work with connect DataFrame too.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

test_pipeline and test_parity_pipeline passed locally.

Was this patch authored or co-authored using generative AI tooling?

No.

gaogaotiantian · 2026-04-30T18:05:36Z

@HyukjinKwon you were right in #55526 (comment) - I should not use classic.DataFrame directly because this util is also used by connect. We should still overwrite __new__ like others did.

https://github.com/apache/spark/actions/runs/25129704754

HyukjinKwon · 2026-05-01T23:51:30Z

Merged to master.

Use pyspark.sql.DataFrame to support connect-only

34cd640

HyukjinKwon approved these changes May 1, 2026

View reviewed changes

HyukjinKwon closed this in 2df302d May 1, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-56607][PYTHON][FOLLOWUP] Use pyspark.sql.DataFrame to support connect-only#55630

[SPARK-56607][PYTHON][FOLLOWUP] Use pyspark.sql.DataFrame to support connect-only#55630
gaogaotiantian wants to merge 1 commit intoapache:masterfrom
gaogaotiantian:fix-mlutils

gaogaotiantian commented Apr 30, 2026

Uh oh!

gaogaotiantian commented Apr 30, 2026 •

edited

Loading

Uh oh!

HyukjinKwon commented May 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

gaogaotiantian commented Apr 30, 2026

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

gaogaotiantian commented Apr 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HyukjinKwon commented May 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

gaogaotiantian commented Apr 30, 2026 •

edited

Loading