Skip to content

Hash join operator (replace nested-loop for large tables) #28

@renecannao

Description

@renecannao

Current JoinOperator uses nested-loop: for each left row, scan all right rows. O(n*m) complexity.

Hash join: build hash table on smaller side, probe with larger side. O(n+m) complexity.

Important for distributed queries where large result sets are joined locally after fetching from multiple backends.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions