Skip to content

[SUPPORT] I have some questions for hudi clustering  #2652

@shenbinglife

Description

@shenbinglife
  1. does the mapping of [<key,partitionpath> -> fileGroupId ] changed after clustering ? the record may wrote to another filegroup?
  2. clusting sort the columns, does it change the physical path of the record to different location which not a partition path by using inlinefs ?
  3. does clustering work on full hudi table or we can choose some partitions?
  4. why clustering ignore the file which size over the targetFileSize? if we ignore it, we should cost time for full scan this file.
  5. when some file is compacting , does clutering scheduler will ignore these files , and then clustering running will still ignore these files whatever the file is compacted ?

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions