Skip to content

[VL][CI] Change to use JDK-17 for Spark 3.3/3.4/3.5 tests#9209

Merged
PHILO-HE merged 7 commits intoapache:mainfrom
PHILO-HE:drop-jdk8-test
Apr 7, 2025
Merged

[VL][CI] Change to use JDK-17 for Spark 3.3/3.4/3.5 tests#9209
PHILO-HE merged 7 commits intoapache:mainfrom
PHILO-HE:drop-jdk8-test

Conversation

@PHILO-HE
Copy link
Member

@PHILO-HE PHILO-HE commented Apr 2, 2025

What changes were proposed in this pull request?

According to CI usage feedback, Spark 3.3/3.4/3.5 tests requires around 3 hours (see #9188). By upgrading to JDK-17, the test time can be reduced a lot.

We will continue the support for JDK-8. In CI, all Spark-3.2 UTs and some TPC tests are still verified on JDK-8, which should be sufficient to uncover code issues on JDK-8, if any.

(Fixes: #9188)

@github-actions github-actions bot added the INFRA label Apr 2, 2025
@github-actions
Copy link

github-actions bot commented Apr 2, 2025

Thanks for opening a pull request!

Could you open an issue for this pull request on Github Issues?

https://github.com/apache/incubator-gluten/issues

Then could you also rename commit message and pull request title in the following format?

[GLUTEN-${ISSUES_ID}][COMPONENT]feat/fix: ${detailed message}

See also:

@zhouyuan
Copy link
Member

zhouyuan commented Apr 2, 2025

We need to document the JDK support policy in the index page.
related: #8574

$MVN_CMD clean test -Pspark-3.5 -Pbackends-velox -Piceberg -Pdelta -Phudi -Pspark-ut \
-DargLine="-Dspark.test.home=/opt/shims/spark35/spark_home/" \
-DtagsToExclude=org.apache.spark.tags.ExtendedSQLTest,org.apache.gluten.tags.UDFTest,org.apache.gluten.tags.SkipTestTags
-DargLine="-Dspark.test.home=$SPARK_HOME ${EXTRA_FLAGS}"
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

spark-test-spark35:
needs: build-native-lib-centos-7
runs-on: ubuntu-20.04
container: apache/gluten:centos-8-jdk8
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should change with jdk17 image for this job also

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

JDK-17 is installed and set to use for each JDK-17 jobs.
I note arrow jars generated by build-native-lib job are actually compiled by JDK-8 and they are reused by other JDK-17 jobs. It works well. So currently we don't need to use JDK-17 docker inside which arrow jars are compiled by JDK-17.

Maybe, we can just keep one docker where both JDK-11 & 17 installed and switch the JDK to the required one for different CI jobs. For now, we can keep the two dockers and observe the CI for some time.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks, I see the point now, it seems for now we need to use a combination of JDK8(arrow jar) + JDK17 spark jars which are little odd. will try to check this on #9218

@github-actions github-actions bot added the CORE works for Gluten Core label Apr 6, 2025
@github-actions
Copy link

github-actions bot commented Apr 6, 2025

Run Gluten Clickhouse CI on x86

@github-actions
Copy link

github-actions bot commented Apr 6, 2025

Run Gluten Clickhouse CI on x86

1 similar comment
@github-actions
Copy link

github-actions bot commented Apr 6, 2025

Run Gluten Clickhouse CI on x86

@github-actions
Copy link

github-actions bot commented Apr 6, 2025

Run Gluten Clickhouse CI on x86

@PHILO-HE
Copy link
Member Author

PHILO-HE commented Apr 7, 2025

We need to document the JDK support policy in the index page. related: #8574

@zhouyuan, just documented it in NewToGluten.md.

Copy link
Member

@zhouyuan zhouyuan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍

@PHILO-HE PHILO-HE merged commit a5ad4a4 into apache:main Apr 7, 2025
47 checks passed
PHILO-HE added a commit to weiting-chen/gluten that referenced this pull request Apr 28, 2025
PHILO-HE added a commit that referenced this pull request Apr 29, 2025
* [GLUTEN-9199][VL] Fix error when creating shuffle file: open with O_CREAT or O_TMPFILE in second argument needs 3 arguments (#9200)

* [DOC] Fix broken links in documents (#9320)

* [GLUTEN-9369][DOC] Fix commands in the Gluten C++ debugging developer doc (#9368)

* [VL][CI] Change to use JDK-17 for Spark 3.3/3.4/3.5 tests (#9209)

* [VL][CI] Bump to use ubuntu-22.04 runner (#9262)

---------

Co-authored-by: Hongze Zhang <hongze.zzz123@gmail.com>
Co-authored-by: PHILO-HE <philo@apache.org>
Co-authored-by: Dina Suehiro Jones <dina.s.jones@intel.com>
Co-authored-by: Yuan <yuanzhou@apache.org>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CORE works for Gluten Core DOCS INFRA

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[VL][CI] Spark-3.5 test needs 3.5 hours with Java-8

2 participants