[FEA] Reorganize and improve Python tests #9999

vyasr · 2022-01-07T19:52:39Z

Note for developers
This is a meta-issue aiming to categorize a wide range of issues. Developers who want to tackle a specific item from the checklist below should create a new issue for just that item, self-assign that issue, and then link it to the checklist above.

Is your feature request related to a problem? Please describe.
There are currently a number of different problems that make it difficult to find, add, or run tests.

Tests are partially and inconsistently organized by functionality, data types, and parametrizations, so it's not clear which file tests of a specific function might be in. The partial organization by dtype is particularly confusing because it means that in some files we test a single function across many dtypes, whereas in other files we test many functions for a single dtype.
Test files are too large and contain too many tests.
There are currently many tests that raise warnings as well as many xfailed tests that actually xpass.
Tests are currently slow to run because we rely on excessive parametrization and we test many private APIs. Additionally, cuIO tests are especially slow because the corresponding libcudf APIs are difficult to test, so a greater burden is placed on the Python APIs to capture a wider range of issues.

Describe the solution you'd like
These are tasks that we propose to undertake to address the various issues discussed above:

The text was updated successfully, but these errors were encountered:

bdice · 2022-01-07T20:20:15Z

I have started looking into xfail_strict=true in PR #9998. That will require some significant work to analyze all the xfail tests that pass (1956 tests failed with xfail_strict=true, and most of those are xpassed strict failures). I am going to prioritize solving locally failing tests like #7314 and reduce the number of warnings in the pytest log first, before going back to #9998.

github-actions · 2022-02-06T21:02:39Z

This issue has been labeled inactive-30d due to no recent activity in the past 30 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed. This issue will be labeled inactive-90d if there is no activity in the next 60 days.

This PR reduces the overall runtime of the cuDF pytest suite. Changes include: - asserting equal on the GPU where possible for large datasets - in some cases reducing excessive test data size part of #9999 Authors: - https://github.com/brandon-b-miller Approvers: - GALI PREM SAGAR (https://github.com/galipremsagar) - Ashwin Srinath (https://github.com/shwina) - Bradley Dice (https://github.com/bdice) URL: #10203

Contributes to #9999 and #10363. Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Ashwin Srinath (https://github.com/shwina) URL: #12293

Contributes to #9999 and #10363. Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Michael Wang (https://github.com/isVoid) URL: #12305

Contributes to #9999 and #10363. Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - https://github.com/brandon-b-miller - Matthew Roeschke (https://github.com/mroeschke) URL: #12304

Contributes to #9999 and #10363. Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Ashwin Srinath (https://github.com/shwina) URL: #12310

Contributes to #9999 and #10363. Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - GALI PREM SAGAR (https://github.com/galipremsagar) - Lawrence Mitchell (https://github.com/wence-) URL: #12326

Contributes to #9999 and #10363. Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Bradley Dice (https://github.com/bdice) URL: #12313

Contributes to #9999 and #10363. Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Bradley Dice (https://github.com/bdice) URL: #12324

One note with these deprecations. pandas has a special value used for parameters that are defaulted, and as a result explicitly passing None will throw a warning. We don't do this, hence the difference in the warnings contexts I use for our calls vs pandas (`pytest.warns` vs `expect_warning_if`). I didn't feel like this was worth commenting on in every place, but I can if reviewers want. Contributes to #9999 and #10363. Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Bradley Dice (https://github.com/bdice) URL: #12334

I realized that my previous warning reduction PRs were causing some circular work where I would add a new warning to cudf to match pandas, which would cause those new warnings to appear in modules that I had previously declared free of warnings. To prevent this, I've changed my approach to instead go through the test modules in alphabetical order and ensure that they are all error free up to that point. This PR removes warnings from all test modules up to test_dataframe.py. Contributes to #9999 and #10363. Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Bradley Dice (https://github.com/bdice) URL: #12355

Contributes to #9999 and #10363. Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Matthew Roeschke (https://github.com/mroeschke) URL: #12381

Contributes to #9999 and #10363. Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Matthew Roeschke (https://github.com/mroeschke) - Ashwin Srinath (https://github.com/shwina) URL: #12369

Contributes to #9999 and #10363. When I merge these changes with #12369 I no longer see any warnings on my machine. I suspect that there will be slightly different results on different machines, so we'll see have to see how CI looks after both PRs are merged before we close #10363. Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Matthew Roeschke (https://github.com/mroeschke) - Bradley Dice (https://github.com/bdice) URL: #12406

vyasr added proposal Change current process or code tests Unit testing for project code quality cuDF (Python) Affects Python cuDF API. improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Jan 7, 2022

vyasr added this to the CuDF Python Refactoring milestone Jan 7, 2022

vyasr added this to Needs prioritizing in Other Issues via automation Jan 7, 2022

vyasr added this to Issue-Needs prioritizing in v22.04 Release via automation Jan 7, 2022

brandon-b-miller mentioned this issue Feb 3, 2022

Reduce pytest runtime #10203

Merged

vyasr mentioned this issue Feb 4, 2022

Refactor isin implementations #10165

Merged

github-actions bot added the inactive-30d label Feb 6, 2022

This was referenced Feb 25, 2022

[FEA] Remove FutureWarnings from Python tests #10363

Closed

UserWarning: A CUDA context for device 0 already exists rapidsai/dask-cuda#867

Closed

bdice mentioned this issue Mar 7, 2022

Enable xfail_strict=true in pytest. #9998

Closed

caryr35 removed this from Issue-Needs prioritizing in v22.04 Release Apr 12, 2022

caryr35 added this to Issue-Needs prioritizing in v22.06 Release via automation Apr 12, 2022

caryr35 removed this from Issue-Needs prioritizing in v22.06 Release Jun 16, 2022

caryr35 added this to Issue-Needs prioritizing in v22.08 Release via automation Jun 16, 2022

vyasr mentioned this issue Jul 12, 2022

Add dev docs for documentation writing #11217

Merged

vyasr mentioned this issue Jul 22, 2022

Add test of wildcard selection #11300

Merged

vyasr mentioned this issue Aug 4, 2022

Define a clear and enforceable standard file layout for cuDF Python #11474

Closed

caryr35 removed this from Issue-Needs prioritizing in v22.08 Release Aug 11, 2022

caryr35 added this to Issue-Needs prioritizing in v22.10 Release via automation Aug 11, 2022

caryr35 removed this from Issue-Needs prioritizing in v22.10 Release Oct 18, 2022

caryr35 added this to Issue-Needs prioritizing in v22.12 Release via automation Oct 18, 2022

shwina removed this from Issue-Needs prioritizing in v22.12 Release Oct 24, 2022

vyasr mentioned this issue Dec 2, 2022

Fix warnings in test_stats.py #12293

Merged

3 tasks

rapids-bot bot pushed a commit that referenced this issue Dec 5, 2022

Fix warnings in test_stats.py (#12293)

7432300

Contributes to #9999 and #10363. Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Ashwin Srinath (https://github.com/shwina) URL: #12293

This was referenced Dec 5, 2022

Fix warnings in test_joining.py #12304

Merged

Fix warnings in test_indexing.py #12305

Merged

Fix warnings in test_multiindex.py #12310

Merged

rapids-bot bot pushed a commit that referenced this issue Dec 5, 2022

Fix warnings in test_indexing.py (#12305)

1ca4dad

Contributes to #9999 and #10363. Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Michael Wang (https://github.com/isVoid) URL: #12305

vyasr mentioned this issue Dec 5, 2022

Fix warnings in test_index.py #12313

Merged

3 tasks

rapids-bot bot pushed a commit that referenced this issue Dec 6, 2022

Fix warnings in test_multiindex.py (#12310)

2d8ebc9

Contributes to #9999 and #10363. Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Ashwin Srinath (https://github.com/shwina) URL: #12310

This was referenced Dec 6, 2022

Fix warnings in test_groupby.py #12324

Merged

Fix warnings in test_orc.py #12326

Merged

This was referenced Dec 7, 2022

Fix warnings in test_monotonic.py #12334

Merged

Fix warnings in test modules up to test_dataframe.py #12355

Merged

rapids-bot bot pushed a commit that referenced this issue Dec 9, 2022

Fix warnings in test_index.py (#12313)

98dc445

Contributes to #9999 and #10363. Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Bradley Dice (https://github.com/bdice) URL: #12313

rapids-bot bot pushed a commit that referenced this issue Dec 10, 2022

Fix warnings in test_groupby.py (#12324)

e5f3544

Contributes to #9999 and #10363. Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Bradley Dice (https://github.com/bdice) URL: #12324

This was referenced Dec 13, 2022

Fix warnings in dataframe.py #12369

Merged

Fix warnings in test_datetime.py #12381

Merged

rapids-bot bot pushed a commit that referenced this issue Dec 15, 2022

Fix warnings in test_datetime.py (#12381)

8f975d6

Contributes to #9999 and #10363. Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Matthew Roeschke (https://github.com/mroeschke) URL: #12381

vyasr mentioned this issue Dec 17, 2022

Fix warnings in remaining modules #12406

Merged

3 tasks

vyasr removed code quality labels Feb 23, 2024

vyasr mentioned this issue May 17, 2024

[BUG] test_concat file instantiates GPU objects in the parametrize arguments #15651

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Reorganize and improve Python tests #9999

[FEA] Reorganize and improve Python tests #9999

vyasr commented Jan 7, 2022 •

edited

bdice commented Jan 7, 2022

github-actions bot commented Feb 6, 2022

[FEA] Reorganize and improve Python tests #9999

[FEA] Reorganize and improve Python tests #9999

Comments

vyasr commented Jan 7, 2022 • edited

bdice commented Jan 7, 2022

github-actions bot commented Feb 6, 2022

vyasr commented Jan 7, 2022 •

edited