Skip to content

[data] split dask and modin tests #54122

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Jun 26, 2025
Merged

Conversation

aslonnie
Copy link
Collaborator

@aslonnie aslonnie commented Jun 26, 2025

as dask tests will require running on python 3.12+ after upgrade; modin does not

also adds dask tag, for running on a separate test job in the future.

as dask tests will require running on python 3.12+
after upgrade; modin does not

Signed-off-by: Lonnie Liu <lonnie@anyscale.com>
@Copilot Copilot AI review requested due to automatic review settings June 26, 2025 02:11
@aslonnie aslonnie requested a review from a team as a code owner June 26, 2025 02:11
@aslonnie aslonnie requested a review from richardliaw June 26, 2025 02:11
Copy link
Contributor

@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR splits the tests for Modin and Dask to accommodate the Python 3.12+ requirements for Dask while keeping Modin tests unaffected.

  • Added a new test file (test_ecosystem_modin.py) with Modin-specific tests.
  • Removed the Modin tests from the Dask test file (test_ecosystem_dask.py).
  • Updated the BUILD file to define separate test targets for Modin and Dask.

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 1 comment.

File Description
python/ray/data/tests/test_ecosystem_modin.py Introduces dedicated tests for Modin functionality.
python/ray/data/tests/test_ecosystem_dask.py Removes Modin test cases, retaining only Dask-related tests.
python/ray/data/BUILD Updates test targets to reflect the split between Modin and Dask.

Comment on lines +11 to +16
def test_from_modin(ray_start_regular_shared):
import modin.pandas as mopd

df = pd.DataFrame(
{"one": list(range(100)), "two": list(range(100))},
)
Copy link
Preview

Copilot AI Jun 26, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[nitpick] The DataFrame initialization logic is repeated in both test_from_modin and test_to_modin. Consider refactoring this into a fixture or helper function to improve code maintainability.

Suggested change
def test_from_modin(ray_start_regular_shared):
import modin.pandas as mopd
df = pd.DataFrame(
{"one": list(range(100)), "two": list(range(100))},
)
def _create_test_dataframe():
return pd.DataFrame(
{"one": list(range(100)), "two": list(range(100))},
)
def test_from_modin(ray_start_regular_shared):
import modin.pandas as mopd
df = _create_test_dataframe()

Copilot uses AI. Check for mistakes.

@aslonnie
Copy link
Collaborator Author

this is required for accepting #52589

@aslonnie aslonnie requested a review from robertnishihara June 26, 2025 02:12
@aslonnie aslonnie added the go add ONLY when ready to merge, run all tests label Jun 26, 2025
@aslonnie aslonnie assigned richardliaw and unassigned richardliaw Jun 26, 2025
@aslonnie aslonnie requested review from bveeramani and raulchen June 26, 2025 18:03
Copy link
Member

@bveeramani bveeramani left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Stamp

@aslonnie aslonnie merged commit e6e1e51 into master Jun 26, 2025
5 checks passed
@aslonnie aslonnie deleted the lonnie-250625-dasksplitdata branch June 26, 2025 23:32
minerharry pushed a commit to minerharry/ray that referenced this pull request Jun 27, 2025
as dask tests will require running on python 3.12+ after upgrade; modin
does not

also adds `dask` tag, for running on a separate test job in the future.

Signed-off-by: Lonnie Liu <lonnie@anyscale.com>
lk-chen pushed a commit to alexeykudinkin/ray that referenced this pull request Jun 29, 2025
as dask tests will require running on python 3.12+ after upgrade; modin
does not

also adds `dask` tag, for running on a separate test job in the future.

Signed-off-by: Lonnie Liu <lonnie@anyscale.com>
elliot-barn pushed a commit that referenced this pull request Jul 2, 2025
as dask tests will require running on python 3.12+ after upgrade; modin
does not

also adds `dask` tag, for running on a separate test job in the future.

Signed-off-by: Lonnie Liu <lonnie@anyscale.com>
Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
goutamvenkat-anyscale pushed a commit to goutamvenkat-anyscale/ray that referenced this pull request Jul 4, 2025
as dask tests will require running on python 3.12+ after upgrade; modin
does not

also adds `dask` tag, for running on a separate test job in the future.

Signed-off-by: Lonnie Liu <lonnie@anyscale.com>
Signed-off-by: Goutam V <goutam@anyscale.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
go add ONLY when ready to merge, run all tests
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants