Skip to content

[SUPPORT] cleaner service is not very well documented #5975

@bithw1

Description

@bithw1

Hi, I am reading at https://hudi.apache.org/docs/hoodie_cleaner,

The usage of utility tool HoodieCleaner is not very well documented, it gives an example

[hoodie]$ spark-submit --class org.apache.hudi.utilities.HoodieCleaner \
  --props s3:///temp/hudi-ingestion-config/kafka-source.properties \
  --target-base-path s3:///temp/hudi \
  --spark-master yarn-cluster

but I don't know the content of the prop file:kafka-source.properties

I would like to know how to specify the configuration property, that I want to keep the latest 3 data versions when I run the cleaner

Thanks a lot

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions