Optimize meta hook file scans#2106
Conversation
Codecov Report❌ Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## master #2106 +/- ##
==========================================
+ Coverage 92.33% 92.35% +0.02%
==========================================
Files 120 120
Lines 24684 24772 +88
==========================================
+ Hits 22792 22879 +87
- Misses 1892 1893 +1 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
There was a problem hiding this comment.
💡 Codex Review
Here are some automated review suggestions for this pull request.
Reviewed commit: 72352fb546
ℹ️ About Codex in GitHub
Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you
- Open a pull request for review
- Mark a draft as ready
- Comment "@codex review".
If Codex has suggestions, it will comment; otherwise it will react with 👍.
Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".
There was a problem hiding this comment.
Pull request overview
This PR optimizes meta hook checks by switching from per-hook filename list collection to boolean “any match” scans, enabling early exit once all candidates match. It also refactors a few supporting APIs to make this scan style reusable across the run/install paths.
Changes:
- Update meta hook implementations to scan project files once and stop early when all relevant hooks/options have matched.
- Extend the shared project-file visitor to support early termination via
ControlFlow. - Refactor language support checking (
Language::supported) into a method and adjust CLI/run code to use it; minor module re-exports/import cleanups.
Reviewed changes
Copilot reviewed 9 out of 9 changed files in this pull request and generated no comments.
Show a summary per file
| File | Description |
|---|---|
| crates/prek/src/languages/mod.rs | Refactors Language::supported into a self method used by callers. |
| crates/prek/src/hooks/meta_hooks.rs | Reworks meta hook scans to avoid allocating filename lists and to break early when done. |
| crates/prek/src/config.rs | Removes HookType::num_args from config (logic moved to CLI hook impl). |
| crates/prek/src/cli/run/run.rs | Uses early-break scanning when deciding which missing env hooks need installation; switches to language.supported(). |
| crates/prek/src/cli/run/mod.rs | Re-exports additional filter types; makes install module private while exposing InstallCache. |
| crates/prek/src/cli/run/filter.rs | Adds ControlFlow-based early-break visitor and exposes tag filtering helpers; removes by_type. |
| crates/prek/src/cli/install.rs | Updates InstallCache import to new re-export location. |
| crates/prek/src/cli/hook_impl.rs | Moves hook arg-count validation logic into this module via hook_num_args. |
| crates/prek/src/cli/cache_gc.rs | Updates InstallCache import to new re-export location. |
Comments suppressed due to low confidence (1)
crates/prek/src/cli/run/filter.rs:396
collect_files_from_argstakes 8 parameters; removing the#[allow(clippy::too_many_arguments)]will likely makecargo clippy -D warningsfail. Either reintroduce the allow attribute or refactor the parameters into an options struct to keep clippy clean.
/// Collect files to run hooks on.
/// Returns a list of file paths relative to the git root.
async fn collect_files_from_args(
git_root: &Path,
workspace_root: &Path,
hook_stage: Stage,
from_ref: Option<String>,
to_ref: Option<String>,
all_files: bool,
files: Vec<String>,
directories: Vec<String>,
) -> Result<Vec<PathBuf>> {
📦 Cargo Bloat ComparisonBinary size change: +0.00% (26.1 MiB → 26.1 MiB) Expand for cargo-bloat outputHead Branch ResultsBase Branch Results |
⚡️ Hyperfine BenchmarksSummary: 1 regressions, 1 improvements above the 10% threshold. Environment
CLI CommandsBenchmarking basic commands in the main repo:
|
| Command | Mean [ms] | Min [ms] | Max [ms] | Relative |
|---|---|---|---|---|
prek-base --version |
2.3 ± 0.1 | 2.2 | 3.1 | 1.02 ± 0.06 |
prek-head --version |
2.2 ± 0.1 | 2.0 | 2.6 | 1.00 |
prek list
| Command | Mean [ms] | Min [ms] | Max [ms] | Relative |
|---|---|---|---|---|
prek-base list |
9.0 ± 0.3 | 8.7 | 11.6 | 1.02 ± 0.05 |
prek-head list |
8.8 ± 0.3 | 8.6 | 11.3 | 1.00 |
prek validate-config .pre-commit-config.yaml
| Command | Mean [ms] | Min [ms] | Max [ms] | Relative |
|---|---|---|---|---|
prek-base validate-config .pre-commit-config.yaml |
3.1 ± 0.1 | 3.0 | 3.6 | 1.07 ± 0.03 |
prek-head validate-config .pre-commit-config.yaml |
2.9 ± 0.0 | 2.8 | 3.1 | 1.00 |
prek sample-config
| Command | Mean [ms] | Min [ms] | Max [ms] | Relative |
|---|---|---|---|---|
prek-base sample-config |
2.5 ± 0.1 | 2.4 | 2.7 | 1.06 ± 0.06 |
prek-head sample-config |
2.4 ± 0.1 | 2.3 | 3.2 | 1.00 |
Cold vs Warm Runs
Comparing first run (cold) vs subsequent runs (warm cache):
prek run --all-files (cold - no cache)
| Command | Mean [ms] | Min [ms] | Max [ms] | Relative |
|---|---|---|---|---|
prek-base run --all-files |
141.5 ± 5.2 | 137.0 | 155.3 | 1.01 ± 0.04 |
prek-head run --all-files |
139.8 ± 2.6 | 137.0 | 144.8 | 1.00 |
prek run --all-files (warm - with cache)
| Command | Mean [ms] | Min [ms] | Max [ms] | Relative |
|---|---|---|---|---|
prek-base run --all-files |
139.1 ± 2.3 | 136.2 | 143.2 | 1.00 |
prek-head run --all-files |
140.9 ± 2.4 | 136.9 | 145.7 | 1.01 ± 0.02 |
Full Hook Suite
Running the builtin hook suite on the benchmark workspace:
prek run --all-files (full builtin hook suite)
| Command | Mean [ms] | Min [ms] | Max [ms] | Relative |
|---|---|---|---|---|
prek-base run --all-files |
140.4 ± 2.6 | 134.3 | 146.2 | 1.00 |
prek-head run --all-files |
141.4 ± 3.1 | 137.3 | 152.4 | 1.01 ± 0.03 |
Individual Hook Performance
Benchmarking each hook individually on the test repo:
prek run trailing-whitespace --all-files
| Command | Mean [ms] | Min [ms] | Max [ms] | Relative |
|---|---|---|---|---|
prek-base run trailing-whitespace --all-files |
20.4 ± 0.4 | 19.8 | 21.7 | 1.00 |
prek-head run trailing-whitespace --all-files |
27.9 ± 40.3 | 19.5 | 241.2 | 1.37 ± 1.98 |
prek run trailing-whitespace --all-files: 36.8200% slower
prek run end-of-file-fixer --all-files
| Command | Mean [ms] | Min [ms] | Max [ms] | Relative |
|---|---|---|---|---|
prek-base run end-of-file-fixer --all-files |
26.1 ± 1.6 | 23.5 | 29.5 | 1.00 |
prek-head run end-of-file-fixer --all-files |
26.4 ± 2.1 | 23.4 | 31.5 | 1.01 ± 0.10 |
prek run check-json --all-files
| Command | Mean [ms] | Min [ms] | Max [ms] | Relative |
|---|---|---|---|---|
prek-base run check-json --all-files |
11.8 ± 0.3 | 11.2 | 12.5 | 1.02 ± 0.03 |
prek-head run check-json --all-files |
11.6 ± 0.2 | 11.2 | 12.2 | 1.00 |
prek run check-yaml --all-files
| Command | Mean [ms] | Min [ms] | Max [ms] | Relative |
|---|---|---|---|---|
prek-base run check-yaml --all-files |
11.0 ± 0.3 | 10.8 | 12.3 | 1.03 ± 0.03 |
prek-head run check-yaml --all-files |
10.7 ± 0.1 | 10.6 | 11.4 | 1.00 |
prek run check-toml --all-files
| Command | Mean [ms] | Min [ms] | Max [ms] | Relative |
|---|---|---|---|---|
prek-base run check-toml --all-files |
11.1 ± 0.3 | 10.6 | 12.2 | 1.04 ± 0.04 |
prek-head run check-toml --all-files |
10.7 ± 0.3 | 10.2 | 11.3 | 1.00 |
prek run check-xml --all-files
| Command | Mean [ms] | Min [ms] | Max [ms] | Relative |
|---|---|---|---|---|
prek-base run check-xml --all-files |
11.0 ± 0.2 | 10.6 | 11.4 | 1.01 ± 0.03 |
prek-head run check-xml --all-files |
10.9 ± 0.3 | 10.5 | 11.5 | 1.00 |
prek run detect-private-key --all-files
| Command | Mean [ms] | Min [ms] | Max [ms] | Relative |
|---|---|---|---|---|
prek-base run detect-private-key --all-files |
17.1 ± 1.3 | 15.3 | 20.0 | 1.03 ± 0.11 |
prek-head run detect-private-key --all-files |
16.7 ± 1.3 | 15.1 | 19.3 | 1.00 |
prek run fix-byte-order-marker --all-files
| Command | Mean [ms] | Min [ms] | Max [ms] | Relative |
|---|---|---|---|---|
prek-base run fix-byte-order-marker --all-files |
21.9 ± 1.6 | 19.5 | 24.7 | 1.02 ± 0.11 |
prek-head run fix-byte-order-marker --all-files |
21.5 ± 1.6 | 18.4 | 24.9 | 1.00 |
Installation Performance
Benchmarking hook installation (fast path hooks skip Python setup):
prek install-hooks (cold - no cache)
| Command | Mean [ms] | Min [ms] | Max [ms] | Relative |
|---|---|---|---|---|
prek-base install-hooks |
4.6 ± 0.1 | 4.5 | 4.7 | 1.06 ± 0.02 |
prek-head install-hooks |
4.3 ± 0.0 | 4.3 | 4.4 | 1.00 |
prek install-hooks (warm - with cache)
| Command | Mean [ms] | Min [ms] | Max [ms] | Relative |
|---|---|---|---|---|
prek-base install-hooks |
4.5 ± 0.0 | 4.5 | 4.6 | 1.06 ± 0.01 |
prek-head install-hooks |
4.3 ± 0.0 | 4.3 | 4.3 | 1.00 |
File Filtering/Scoping Performance
Testing different file selection modes:
prek run (staged files only)
| Command | Mean [ms] | Min [ms] | Max [ms] | Relative |
|---|---|---|---|---|
prek-base run |
47.0 ± 0.7 | 46.0 | 48.8 | 1.00 |
prek-head run |
47.0 ± 0.7 | 46.1 | 48.7 | 1.00 ± 0.02 |
prek run --files '*.json' (specific file type)
| Command | Mean [ms] | Min [ms] | Max [ms] | Relative |
|---|---|---|---|---|
prek-base run --files '*.json' |
7.9 ± 0.1 | 7.8 | 8.0 | 1.01 ± 0.02 |
prek-head run --files '*.json' |
7.8 ± 0.1 | 7.6 | 8.2 | 1.00 |
Workspace Discovery & Initialization
Benchmarking hook discovery and initialization overhead:
prek run --dry-run --all-files (measures init overhead)
| Command | Mean [ms] | Min [ms] | Max [ms] | Relative |
|---|---|---|---|---|
prek-base run --dry-run --all-files |
10.1 ± 0.1 | 9.9 | 10.4 | 1.01 ± 0.02 |
prek-head run --dry-run --all-files |
10.0 ± 0.2 | 9.8 | 10.5 | 1.00 |
Meta Hooks Performance
Benchmarking meta hooks separately:
prek run check-hooks-apply --all-files
| Command | Mean [ms] | Min [ms] | Max [ms] | Relative |
|---|---|---|---|---|
prek-base run check-hooks-apply --all-files |
12.4 ± 0.2 | 12.2 | 12.9 | 1.05 ± 0.02 |
prek-head run check-hooks-apply --all-files |
11.8 ± 0.1 | 11.6 | 12.0 | 1.00 |
prek run check-useless-excludes --all-files
| Command | Mean [ms] | Min [ms] | Max [ms] | Relative |
|---|---|---|---|---|
prek-base run check-useless-excludes --all-files |
12.1 ± 0.3 | 11.2 | 12.3 | 1.15 ± 0.05 |
prek-head run check-useless-excludes --all-files |
10.5 ± 0.3 | 10.3 | 11.7 | 1.00 |
✅ Performance improvement for prek run check-useless-excludes --all-files: 13.0400% faster
prek run identity --all-files
| Command | Mean [ms] | Min [ms] | Max [ms] | Relative |
|---|---|---|---|---|
prek-base run identity --all-files |
9.8 ± 0.1 | 9.6 | 10.1 | 1.00 |
prek-head run identity --all-files |
10.1 ± 1.2 | 9.4 | 13.2 | 1.04 ± 0.12 |
This MR contains the following updates: | Package | Update | Change | |---|---|---| | [prek](https://github.com/j178/prek) | patch | `0.4.0` → `0.4.1` | MR created with the help of [el-capitano/tools/renovate-bot](https://gitlab.com/el-capitano/tools/renovate-bot). **Proposed changes to behavior should be submitted there as MRs.** --- ### Release Notes <details> <summary>j178/prek (prek)</summary> ### [`v0.4.1`](https://github.com/j178/prek/blob/HEAD/CHANGELOG.md#041) [Compare Source](j178/prek@v0.4.0...v0.4.1) Released on 2026-05-20. ##### Enhancements - Fix pre-push range after rebase ([#​2089](j178/prek#2089)) - Prefer extensions over loose filename tags ([#​2092](j178/prek#2092)) - Skip installs for hooks that will not run ([#​2103](j178/prek#2103)) ##### Performance - Optimize meta hook file scans ([#​2106](j178/prek#2106)) - Reduce run filtering allocations ([#​2090](j178/prek#2090)) ##### Contributors - [@​j178](https://github.com/j178) </details> --- ### Configuration 📅 **Schedule**: (UTC) - Branch creation - At any time (no schedule defined) - Automerge - At any time (no schedule defined) 🚦 **Automerge**: Disabled by config. Please merge this manually once you are satisfied. ♻ **Rebasing**: Whenever MR becomes conflicted, or you tick the rebase/retry checkbox. 🔕 **Ignore**: Close this MR and you won't be reminded about this update again. --- - [ ] <!-- rebase-check -->If you want to rebase/retry this MR, check this box --- This MR has been generated by [Mend Renovate](https://github.com/renovatebot/renovate). <!--renovate-debug:eyJjcmVhdGVkSW5WZXIiOiI0My4xODUuMCIsInVwZGF0ZWRJblZlciI6IjQzLjE4NS4wIiwidGFyZ2V0QnJhbmNoIjoibWFpbiIsImxhYmVscyI6WyJSZW5vdmF0ZSBCb3QiLCJhdXRvbWF0aW9uOmJvdC1hdXRob3JlZCIsImRlcGVuZGVuY3ktdHlwZTo6cGF0Y2giXX0=-->
Optimize the meta hook checks so they scan project files for boolean matches instead of collecting per-hook filename lists.
The shared scan helper can now stop early once all candidates have matched.