Skip to content

[Spark] Disable S3A change detection for SeaweedFS compatibility#279

Merged
alxtkr77 merged 1 commit intomlrun:developmentfrom
alxtkr77:CEML-669
Mar 19, 2026
Merged

[Spark] Disable S3A change detection for SeaweedFS compatibility#279
alxtkr77 merged 1 commit intomlrun:developmentfrom
alxtkr77:CEML-669

Conversation

@alxtkr77
Copy link
Member

Summary

  • Spark jobs fail with NoVersionAttributeException: Change detection policy requires ETag when reading from SeaweedFS
  • SeaweedFS does not return ETags in the format expected by Hadoop's S3A change detection policy
  • Fix: disable ETag-based change detection by adding spark.hadoop.fs.s3a.change.detection.mode none to spark-defaults.conf in the CE Spark Docker image

Test plan

  • Rebuild Spark image (cd docker/spark && make build)
  • Push image to registry
  • Deploy CE with updated image
  • Run test_mlrun_spark_operator_using_set_function to verify Spark jobs can read from SeaweedFS without ETag errors

Fixes: CEML-669

🤖 Generated with Claude Code

…ML-669)

SeaweedFS does not return ETags in the format expected by Hadoop's S3A
change detection policy, causing Spark jobs to fail with
NoVersionAttributeException. Disable ETag-based change detection in
spark-defaults.conf.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

RUN mkdir /home/spark && chown spark /home/spark

# Disable S3A ETag-based change detection — SeaweedFS does not return ETags
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@alxtkr77, is there a way to add this in the Spark config during runtime?
by using spark_conf.update for example.
if not, we need to explain to users that they must build this image before using Spark in CE, right?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@GiladShapira94 , no need for user to build the image. We will build it and push to gcr. The image will be pulled on CE installation

@alxtkr77 alxtkr77 merged commit 3d69d03 into mlrun:development Mar 19, 2026
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants