Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-47500][PYTHON][CONNECT] Factor column name handling out of plan.py #45636

Closed
wants to merge 1 commit into from

Conversation

zhengruifeng
Copy link
Contributor

@zhengruifeng zhengruifeng commented Mar 21, 2024

What changes were proposed in this pull request?

Factor column name handling out of plan.py

Why are the changes needed?

there are too many parameters preprocessing in plan.py, e.g. the column name handling,
there are multiple duplicated helper functions here and there, make it hard to follow some times.

Does this PR introduce any user-facing change?

no, just code refactor

How was this patch tested?

ci

Was this patch authored or co-authored using generative AI tooling?

no

init

init

init

init

init

nit

nit

nit

reuse
@HyukjinKwon
Copy link
Member

Merged to master.

@zhengruifeng zhengruifeng deleted the plan_clean_up branch March 22, 2024 06:10
sweisdb pushed a commit to sweisdb/spark that referenced this pull request Apr 1, 2024
…an.py`

### What changes were proposed in this pull request?
Factor column name handling out of `plan.py`

### Why are the changes needed?
there are too many parameters preprocessing in `plan.py`, e.g. the column name handling,
there are multiple duplicated helper functions here and there, make it hard to follow some times.

### Does this PR introduce _any_ user-facing change?
no, just code refactor

### How was this patch tested?
ci

### Was this patch authored or co-authored using generative AI tooling?
no

Closes apache#45636 from zhengruifeng/plan_clean_up.

Authored-by: Ruifeng Zheng <ruifengz@apache.org>
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
HyukjinKwon pushed a commit that referenced this pull request Jun 11, 2024
…ataFrame.select(None)`

### What changes were proposed in this pull request?
the refactor PR #45636 changed the error message of `DataFrame.select(None)` from `PySparkTypeError` to `AssertionError`, this PR restore the previous error message

### Why are the changes needed?
error message improvement

### Does this PR introduce _any_ user-facing change?
yes, error message improvement

### How was this patch tested?
added test

### Was this patch authored or co-authored using generative AI tooling?
no

Closes #46930 from zhengruifeng/py_restore_select_error.

Authored-by: Ruifeng Zheng <ruifengz@apache.org>
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
2 participants