Implement Radix Hash Join

### Is your feature request related to a problem or challenge?

Hash Joins with large build side & lots of hash-duplicates are relatively slow in DataFusion.

The cost seems largely associated with traversing the chain of duplicates (`chain_traverse`) (1) + which is known to be very cache-inefficient, as the access pattern is mostly random.

Currently, we implement hash joins partitioned by hash, but we can implement a more efficient algorithm (radix hash join) that splits build data into smaller tables that individually mostly fits in CPU caches and allow more efficient access patterns.

_[TODO: collect some issues / examples]_

(1) https://github.com/apache/datafusion/issues/17494

### Describe the solution you'd like

Implement a version of Radix Hash Joins:

<img width="413" height="282" alt="Image" src="https://github.com/user-attachments/assets/14db8f23-a386-4e12-8b16-57ebe9823cad" />

https://15721.courses.cs.cmu.edu/spring2016/papers/balkesen-icde2013.pdf


### Describe alternatives you've considered

_No response_

### Additional context

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement Radix Hash Join #18939

Is your feature request related to a problem or challenge?

Describe the solution you'd like

Describe alternatives you've considered

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Implement Radix Hash Join #18939

Description

Is your feature request related to a problem or challenge?

Describe the solution you'd like

Describe alternatives you've considered

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions