Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] parquet_tests are skipped on Dataproc CI #10255

Closed
yinqingh opened this issue Jan 24, 2024 · 2 comments · Fixed by #10356
Closed

[BUG] parquet_tests are skipped on Dataproc CI #10255

yinqingh opened this issue Jan 24, 2024 · 2 comments · Fixed by #10356
Assignees
Labels
bug Something isn't working

Comments

@yinqingh
Copy link
Collaborator

Describe the bug
The parquet_tests are skipped on Dataproc CI

[2024-01-21T18:56:32.815Z] [33m=============================== warnings summary ===============================[0m
[2024-01-21T18:56:32.815Z] ../../src/main/python/parquet_testing_test.py:100
[2024-01-21T18:56:32.815Z]   /home/sa_116163337916449219958/integration_tests/src/main/python/parquet_testing_test.py:100: UserWarning: Skipping parquet-testing tests. 
                             Unable to locate data in any of: hdfs:/tmp/rapids_it/src/test/resources/parquet-testing/data/*.parquet, hdfs:/tmp/rapids_it/src/test/resources/parquet-testing/bad_data/*.parquet, /home/sa_116163337916449219958/thirdparty/parquet-testing/data/*.parquet, /home/sa_116163337916449219958/thirdparty/parquet-testing/bad_data/*.parquet
[2024-01-21T18:56:32.815Z]     warnings.warn("Skipping parquet-testing tests. Unable to locate data in any of: " + locations)

Expected behavior
Expect the parquet tests to be run on dataproc ci

Additional context
Seems the parquet test files are packaged in the hdfs path hdfs:/tmp/rapids_it/src/test/resources/parquet-testing and locate_parquet_testing_files function (https://github.com/NVIDIA/spark-rapids/blob/branch-24.02/integration_tests/src/main/python/parquet_testing_test.py#L91) only supports to search files in local file system, so the test data could not be found and the tests are skipped.

@yinqingh yinqingh added bug Something isn't working ? - Needs Triage Need team to review and classify labels Jan 24, 2024
@jlowe jlowe self-assigned this Jan 29, 2024
@mattahrens mattahrens removed the ? - Needs Triage Need team to review and classify label Jan 30, 2024
@jlowe jlowe assigned yinqingh and unassigned jlowe Feb 1, 2024
@mattahrens
Copy link
Collaborator

@yinqingh can this issue be closed?

@yinqingh
Copy link
Collaborator Author

yinqingh commented Apr 7, 2024

Sorry I forget to close this. This issue has been fixed.

@yinqingh yinqingh closed this as completed Apr 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
3 participants