New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-36537][PYTHON] Revisit disabled tests for CategoricalDtype #33817
Conversation
…place updates with CategoricalDtype.
Test build #142716 has finished for PR 33817 at commit
|
Kubernetes integration test starting |
Kubernetes integration test status failure |
Test build #142745 has finished for PR 33817 at commit
|
Kubernetes integration test starting |
Kubernetes integration test status success |
Thanks for the correction, @ueshin |
Test build #142777 has finished for PR 33817 at commit
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, pending tests.
Test build #142779 has finished for PR 33817 at commit
|
Kubernetes integration test starting |
Kubernetes integration test status failure |
Kubernetes integration test starting |
Kubernetes integration test status success |
Merged to master. |
This PR proposes to enable the tests, disabled since different behavior with pandas 1.3. - `inplace` argument for `CategoricalDtype` functions is deprecated from pandas 1.3, and seems they have bug. So we manually created the expected result and test them. - Fixed the `GroupBy.transform` since it doesn't work properly for `CategoricalDtype`. We should enable the tests as much as possible even if pandas has a bug. And we should follow the behavior of latest pandas. Yes, `GroupBy.transform` now follow the behavior of latest pandas. Unittests. Closes #33817 from itholic/SPARK-36537. Authored-by: itholic <haejoon.lee@databricks.com> Signed-off-by: Hyukjin Kwon <gurwls223@apache.org> (cherry picked from commit fe48618) Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
Merged to branch-3.2 too. |
What changes were proposed in this pull request?
This PR proposes to enable the tests, disabled since different behavior with pandas 1.3.
inplace
argument forCategoricalDtype
functions is deprecated from pandas 1.3, and seems they have bug. So we manually created the expected result and test them.GroupBy.transform
since it doesn't work properly forCategoricalDtype
.Why are the changes needed?
We should enable the tests as much as possible even if pandas has a bug.
And we should follow the behavior of latest pandas.
Does this PR introduce any user-facing change?
Yes,
GroupBy.transform
now follow the behavior of latest pandas.How was this patch tested?
Unittests.