Skip to content

[GLUTEN-10660][VL] Fix passing Velox session configurations on hash build optimizations#11134

Merged
philo-he merged 4 commits intoapache:mainfrom
zhouyuan:wip_fix_velox_config
Dec 6, 2025
Merged

[GLUTEN-10660][VL] Fix passing Velox session configurations on hash build optimizations#11134
philo-he merged 4 commits intoapache:mainfrom
zhouyuan:wip_fix_velox_config

Conversation

@zhouyuan
Copy link
Copy Markdown
Member

@zhouyuan zhouyuan commented Nov 20, 2025

What changes are proposed in this pull request?

some important configurations on Velox hashmap dedup are not passed to Velox backend

How was this patch tested?

locally verified

Related issue: #10660

@github-actions github-actions Bot added CORE works for Gluten Core VELOX labels Nov 20, 2025
@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

@zhouyuan zhouyuan changed the title [VL] Fix passing Velox session configurations on hash build optimizations [GLUTEN-10660][VL] Fix passing Velox session configurations on hash build optimizations Nov 20, 2025
.foreach(entry => nativeConfMap.put(entry._1, entry._2))

// Backend's dynamic session conf only.
val confPrefix = prefixOf(backendName)
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could you change this method name from prefixOf to sessionPrefixOf?

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The code should be work for the hash build dedup feature. However for now Gluten actually does not have a clear boundary on session level vs. static config, I guess we may need some refactor on this part in future work.

std::to_string(veloxCfg_->get<int32_t>(kAbandonBuildNoDupHashMinRows, 100000));
configs[velox::core::QueryConfig::kAbandonBuildNoDupHashMinPct] =
std::to_string(veloxCfg_->get<int32_t>(kAbandonBuildNoDupHashMinPct, 0));
std::to_string(static_cast<int32_t>(veloxCfg_->get<double>(kAbandonBuildNoDupHashMinPct, 0)));
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Copy Markdown
Member Author

@zhouyuan zhouyuan Dec 4, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it's mostly due to the config name has a percentile so we change that to double based config when enable this feature. I have modified to use int based and update the comments

Signed-off-by: Yuan <yuanzhou@apache.org>
@zhouyuan zhouyuan force-pushed the wip_fix_velox_config branch from 4469d8b to 765c5c4 Compare December 4, 2025 14:07
@github-actions
Copy link
Copy Markdown

github-actions Bot commented Dec 4, 2025

Run Gluten Clickhouse CI on x86

1 similar comment
@github-actions
Copy link
Copy Markdown

github-actions Bot commented Dec 4, 2025

Run Gluten Clickhouse CI on x86

Signed-off-by: Yuan <yuanzhou@apache.org>
@zhouyuan zhouyuan force-pushed the wip_fix_velox_config branch from 97a0328 to 51bed04 Compare December 4, 2025 14:34
@github-actions
Copy link
Copy Markdown

github-actions Bot commented Dec 4, 2025

Run Gluten Clickhouse CI on x86

Signed-off-by: Yuan <yuanzhou@apache.org>
@github-actions
Copy link
Copy Markdown

github-actions Bot commented Dec 4, 2025

Run Gluten Clickhouse CI on x86

@github-actions
Copy link
Copy Markdown

github-actions Bot commented Dec 5, 2025

Run Gluten Clickhouse CI on x86

Copy link
Copy Markdown
Contributor

@beliefer beliefer left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It looks good to me if the session configurations used by Velox.

@beliefer
Copy link
Copy Markdown
Contributor

beliefer commented Dec 5, 2025

The fail test seems not related to this PR. It is failed in other PR too.

@zhouyuan zhouyuan requested a review from wForget December 5, 2025 23:06
Copy link
Copy Markdown
Member

@philo-he philo-he left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks. Assume the CI failure is not related.

@philo-he philo-he merged commit 8c78bb0 into apache:main Dec 6, 2025
104 of 107 checks passed
@zhouyuan
Copy link
Copy Markdown
Member Author

zhouyuan commented Dec 8, 2025

@wForget @beliefer @philo-he thanks for the review.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CORE works for Gluten Core VELOX

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants