Skip to content

[Data] Jail unstructured_data_ingestion release test#63236

Merged
goutamvenkat-anyscale merged 3 commits into
ray-project:masterfrom
goutamvenkat-anyscale:goutam/bump_up_timeout_unstructured
May 8, 2026
Merged

[Data] Jail unstructured_data_ingestion release test#63236
goutamvenkat-anyscale merged 3 commits into
ray-project:masterfrom
goutamvenkat-anyscale:goutam/bump_up_timeout_unstructured

Conversation

@goutamvenkat-anyscale
Copy link
Copy Markdown
Contributor

@goutamvenkat-anyscale goutamvenkat-anyscale commented May 8, 2026

Description

Move the unstructured_data_ingestion release test from weekly to manual while the poor throughput / CPU utilization issue is investigated.

The previous timeout bump could mask the underlying runtime behavior, so this change removes the test from scheduled release runs instead of extending its runtime budget.

Jailing test because the current CPU utilization is too low and the test needs to be rewritten.

Test plan

  • Verified release/release_tests.yaml parses and unstructured_data_ingestion resolves to frequency: manual.
  • Commit hooks passed for the staged change.

…test

Signed-off-by: Goutam <goutam@anyscale.com>
@goutamvenkat-anyscale goutamvenkat-anyscale added data Ray Data-related issues go add ONLY when ready to merge, run all tests labels May 8, 2026
Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request increases the timeout for a release test in release/release_tests.yaml from 3600 to 5000 seconds. Feedback indicates that this increase might be masking a root cause issue, specifically an invalid pandas version dependency in an associated script that could be causing the test to fail or hang.

Comment thread release/release_tests.yaml Outdated

run:
timeout: 3600
timeout: 5000
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

Increasing the timeout to 5000 seconds might be masking an underlying issue. I noticed that the associated BYOD script release/ray_release/byod/byod_unstructured_data_ingestion.sh (line 11) attempts to install pandas==2.3.3, which is not a valid version (the latest stable version is 2.2.3). If the test is failing or hanging due to environment setup issues caused by this invalid dependency, increasing the timeout will not resolve the root cause. Please verify the dependencies in the BYOD script to ensure the test isn't failing for reasons other than execution time.

Move the unstructured data ingestion release test to manual while the throughput issue is investigated.

Signed-off-by: Goutam <goutam@anyscale.com>
Co-authored-by: Cursor <cursoragent@cursor.com>
@goutamvenkat-anyscale goutamvenkat-anyscale changed the title [Data] - Bump up timeout for the unstructured_data_ingestion release test [Data] Jail unstructured_data_ingestion release test May 8, 2026
Restore the unstructured data ingestion release test timeout to 3600 seconds now that the test is manual.

Signed-off-by: Goutam <goutam@anyscale.com>
Co-authored-by: Cursor <cursoragent@cursor.com>
Copy link
Copy Markdown
Member

@bveeramani bveeramani left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you add a comment why we're disabling this?

@goutamvenkat-anyscale goutamvenkat-anyscale merged commit 4549a87 into ray-project:master May 8, 2026
6 checks passed
@goutamvenkat-anyscale goutamvenkat-anyscale deleted the goutam/bump_up_timeout_unstructured branch May 8, 2026 23:42
chillCode404 pushed a commit to chillCode404/ray-contrib that referenced this pull request May 9, 2026
## Description
Move the `unstructured_data_ingestion` release test from `weekly` to
`manual` while the poor throughput / CPU utilization issue is
investigated.

The previous timeout bump could mask the underlying runtime behavior, so
this change removes the test from scheduled release runs instead of
extending its runtime budget.


Jailing test because the current CPU utilization is too low and the test
needs to be rewritten.

## Test plan
- Verified `release/release_tests.yaml` parses and
`unstructured_data_ingestion` resolves to `frequency: manual`.
- Commit hooks passed for the staged change.

---------

Signed-off-by: Goutam <goutam@anyscale.com>
Co-authored-by: Cursor <cursoragent@cursor.com>
dancingactor pushed a commit to dancingactor/ray that referenced this pull request May 13, 2026
## Description
Move the `unstructured_data_ingestion` release test from `weekly` to
`manual` while the poor throughput / CPU utilization issue is
investigated.

The previous timeout bump could mask the underlying runtime behavior, so
this change removes the test from scheduled release runs instead of
extending its runtime budget.


Jailing test because the current CPU utilization is too low and the test
needs to be rewritten.

## Test plan
- Verified `release/release_tests.yaml` parses and
`unstructured_data_ingestion` resolves to `frequency: manual`.
- Commit hooks passed for the staged change.

---------

Signed-off-by: Goutam <goutam@anyscale.com>
Co-authored-by: Cursor <cursoragent@cursor.com>
am-kinetica pushed a commit to kineticadb/ray that referenced this pull request May 14, 2026
## Description
Move the `unstructured_data_ingestion` release test from `weekly` to
`manual` while the poor throughput / CPU utilization issue is
investigated.

The previous timeout bump could mask the underlying runtime behavior, so
this change removes the test from scheduled release runs instead of
extending its runtime budget.

Jailing test because the current CPU utilization is too low and the test
needs to be rewritten.

## Test plan
- Verified `release/release_tests.yaml` parses and
`unstructured_data_ingestion` resolves to `frequency: manual`.
- Commit hooks passed for the staged change.

---------

Signed-off-by: Goutam <goutam@anyscale.com>
Co-authored-by: Cursor <cursoragent@cursor.com>
Signed-off-by: anindyam1969 <amukherjee@kinetica.com>
Lucas61000 pushed a commit to Lucas61000/ray that referenced this pull request May 15, 2026
## Description
Move the `unstructured_data_ingestion` release test from `weekly` to
`manual` while the poor throughput / CPU utilization issue is
investigated.

The previous timeout bump could mask the underlying runtime behavior, so
this change removes the test from scheduled release runs instead of
extending its runtime budget.


Jailing test because the current CPU utilization is too low and the test
needs to be rewritten.

## Test plan
- Verified `release/release_tests.yaml` parses and
`unstructured_data_ingestion` resolves to `frequency: manual`.
- Commit hooks passed for the staged change.

---------

Signed-off-by: Goutam <goutam@anyscale.com>
Co-authored-by: Cursor <cursoragent@cursor.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

data Ray Data-related issues go add ONLY when ready to merge, run all tests

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants