Releases · SuarezPM/apohara-codesearch

11 Jun 20:09

github-actions

v0.3.0

3b6055c

0.3.0 - 2026-06-11 Latest

Latest

Release Notes

Added

5 new tree-sitter grammars (Bash, Java, C, Ruby, C++) for the structural
extractor. The engine now supports 9 languages for symbol-level
indexing (function definitions, type declarations, imports/exports):
Rust, TypeScript, Python, Go (the historical set) + Bash, Java, C, Ruby,
C++. Each new grammar ships its own parser, import extractor, fuzz
target, and a checked-in fixture under tests/fixtures/. Kotlin is
deferred to v0.4.0 per the plan's barred-entry rule (R-1.1):
tree-sitter-kotlin has no 0.23.x line on crates.io, which would break the
workspace's tree-sitter version pin policy.
Corpus freezes for the v0.3.0 measurement (.omc/plans/.../corpus_freeze.rs):
- tests/fixtures/bench-corpus-frozen-A/: 22-file copy of
  examples/bench-corpus/ at the v0.2.0 commit, content-hash pinned.
- tests/fixtures/bench-corpus-frozen-B/queries.json: 10-query golden-test
  subset for the F3 default-flip measurement.
- The guard test crates/apohara-codesearch/tests/corpus_freeze.rs
  fails on any drift; refreezing requires a chore(bench): refreeze corpus X commit.
OpenSSF Scorecard audit (.omc/plans/apohara-codesearch-scorecard-audit.md):
measured aggregate 7.0/10 (not 5.8 — that number in CLAUDE.md was
stale). 9 of 18 checks at 10/10, 4 at 0-4. The QW-2 fix pins
cargo-audit to the Cargo.lock version (ee8b06a). QW-1 (Maintained)
is a structural repo-age penalty that resolves itself after 90 days.
QW-3 re-score showed 0 immediate delta (scorecard needs 24-48h to
re-index); expected +2 once indexed.
F3 BENCHMARK baseline (BENCHMARK.md v0.3.0 section): the v0.2.0
hybrid-search baseline on the frozen corpus A is BM25
recall@5=0.542/recall@10=0.625/MRR=0.326, vector 0.083/0.083/0.063,
hybrid 0.458/0.542/0.285, with 9/24 queries where hybrid < best
single mode (38%). The bench surface cannot measure the
proposed-flip variants directly (see "Changed" below for why the flips
are deferred).

Changed

No default flips this cycle. Per the v0.3.0 plan
(.omc/plans/apohara-codesearch-3frentes.md §6), the proposed
adaptive=true / diversify=true default flips require a
data-driven positive-lift measurement that the bench-search harness
cannot produce (both opt-ins live in the server-side search_code
wrapper, not in the indexer-level rrf_fuse). F3-FLIP-CHECK
therefore has no data to apply the split criteria, and Pablo
chose to defer both flips to v0.4.0 with the appropriate plumbing
to measure them server-side. The v0.3.0 release is therefore
structural-extraction-focused, not ranking-focused. Rollback
path for the flips is documented in the plan §10 and remains
valid for the v0.4.0 measurement.
legacy.rb fixture renamed to legacy.foo in
examples/demo-repo/. Reason: the v0.3.0 grammar expansion
means .rb is now a parsed language; the ac4 integration test
needed an extension no grammar recognizes. The file's content is
unchanged.
test_detect_language_c updated to reflect that C++ extensions
(.cpp/.hpp/.cc/.cxx/.hxx/.hh) now map to Language::Cpp
instead of returning None. The C vs C++ split follows the
tree-sitter convention (one grammar per major).
Module symbol kind added to SymbolKind enum (Ruby module
declaration support).
Workspace tree-sitter dep set expanded: tree-sitter-bash,
tree-sitter-java, tree-sitter-c, tree-sitter-ruby,
tree-sitter-cpp (all at 0.23.x to match the existing pin).

Notes

Binary size on linux-x64: +7.99 MB (+62.58%) vs v0.2.0. Each new
tree-sitter grammar contributes ~0.5-3.5 MB to the statically-linked
binary (the C parser-table C code is the dominant cost; Java
surprised as the smallest at +0.43 MB, Ruby at +2.05 MB, C++ at
+3.45 MB). Pablo approved "all 6 grammars default" at the
size-budget gate (the cumulative projection was revised from +60%
to +62.58% as the actual measurements came in). The
v0.3.0 plan's C++/SACRED resolution still applies: the
windows-msvc artifact has a +20% budget; if the windows-msvc
build exceeds it, C++ goes per-target default = [] and is
opt-in via cargo build --features cpp. This must be verified
at the F3-RELEASE / CI step.
OpenSSF Scorecard: 7.0/10 baseline measured. The 3 quick wins
approved by Pablo (pin cargo-audit) are committed. No further
Scorecard work in this release; the audit doc remains the source
of truth for follow-ups.
Kotlin deferred to v0.4.0 — see "Added" notes.

Download apohara-codesearch 0.3.0

File	Platform	Checksum
apohara-codesearch-aarch64-apple-darwin.tar.xz	Apple Silicon macOS	checksum
apohara-codesearch-x86_64-apple-darwin.tar.xz	Intel macOS	checksum
apohara-codesearch-x86_64-pc-windows-msvc.zip	x64 Windows	checksum
apohara-codesearch-aarch64-unknown-linux-gnu.tar.xz	ARM64 Linux	checksum
apohara-codesearch-x86_64-unknown-linux-gnu.tar.xz	x64 Linux	checksum

Verifying GitHub Artifact Attestations

The artifacts in this release have attestations generated with GitHub Artifact Attestations. These can be verified by using the GitHub CLI:

gh attestation verify <file-path of downloaded artifact> --repo SuarezPM/apohara-codesearch

You can also download the attestation from GitHub and verify against that directly:

gh attestation verify <file-path of downloaded artifact> --bundle <file-path of downloaded attestation>

Assets 16

apohara-codesearch-aarch64-apple-darwin.tar.xz

sha256:4e6aa4ba7c5d6a095cb3d531e8429be37dfe80401328d8024f5679ad26f05882

2.68 MB 2026-06-11T20:09:38Z
apohara-codesearch-aarch64-apple-darwin.tar.xz.sha256

sha256:d8719266ca43de2daed13680d1635a3afb099289f2e1a8a7716bb78eea481c9d

114 Bytes 2026-06-11T20:09:38Z
apohara-codesearch-aarch64-unknown-linux-gnu.tar.xz

sha256:7220574171fc758dd2d56c419c6fb9649de5be257a9be6d1693f1f403e0ed85f

2.82 MB 2026-06-11T20:09:38Z
apohara-codesearch-aarch64-unknown-linux-gnu.tar.xz.sha256

sha256:78869d713e7d83a0d179880af6d8d90b687fdeab56f86f9d5b9e6a8c7f9225f4

119 Bytes 2026-06-11T20:09:38Z
apohara-codesearch-x86_64-apple-darwin.tar.xz

sha256:80154d285a65b646e9685ee6132316f343b3075cba9413868931937364e0cf00

2.9 MB 2026-06-11T20:09:38Z
apohara-codesearch-x86_64-apple-darwin.tar.xz.sha256

sha256:de0bf48b04a38c649f338d0d9c5f1740046280d1bc30d7493eed25f16b64d6e2

113 Bytes 2026-06-11T20:09:39Z
apohara-codesearch-x86_64-pc-windows-msvc.zip

sha256:0ba7c3d7e51c87c30e2da2114f9d14e18ee609a79ebc03b3c967b18de7e57424

4.11 MB 2026-06-11T20:09:39Z
apohara-codesearch-x86_64-pc-windows-msvc.zip.sha256

sha256:efe639cdc27e7e740850fff7633f5f47062fa8d8f02206d90fba65086d7255ac

113 Bytes 2026-06-11T20:09:39Z
apohara-codesearch-x86_64-unknown-linux-gnu.tar.xz

sha256:3d071af184b48eb11ed1d840386e4c75157bcca0c923f4f90eff693adea0b0c6

3.05 MB 2026-06-11T20:09:39Z
apohara-codesearch-x86_64-unknown-linux-gnu.tar.xz.sha256

sha256:8a7a11df44cb00df5c445e8bb67669b7427a9da9fc3b79990f731b4563c5cf01

118 Bytes 2026-06-11T20:09:39Z
Source code (zip)

2026-06-11T19:39:54Z
Source code (tar.gz)

2026-06-11T19:39:54Z

08 Jun 00:31

github-actions

v0.2.0

d442c73

0.2.0 - 2026-06-07

Release Notes

Added

Real EmbeddingGemma embedder in pure candle (opt-in gguf-embed): a
from-scratch forward pass of EmbeddingGemma-300m (Gemma3 encoder + dense head)
in candle 0.10 with no native dependencies, loading user-supplied
safetensors weights from a local path — never downloaded. Validated to cosine
0.99998 vs the official ONNX reference; 256-d Matryoshka output, asymmetric
query/document prompts. On CodeSearchNet the vector arm goes from feature-hash
noise (recall@5 0.34/0.005/0.035) to 0.95/0.99/0.885, hybrid now beats
BM25-only on all three slices, and the adaptive recovery gate closes — see
BENCHMARK.md. The default build is unchanged: still the
deterministic feature-hash, still zero-model and offline.
Project governance & OpenSSF Best Practices artifacts:
CONTRIBUTING.md, CODE_OF_CONDUCT.md
(Contributor Covenant 3.0), GOVERNANCE.md, this changelog,
docs/ASSURANCE.md (assurance case), and
docs/best-practices-silver.md (criteria
evidence map).
Supply-chain / OpenSSF Scorecard hardening: cargo-deny + cargo-audit
jobs, a Dependabot config (Dependency-Update-Tool), a CodeQL workflow for
Rust + Actions (SAST), all GitHub Actions pinned to commit SHAs
(Pinned-Dependencies), and least-privilege top-level contents: read token
permissions across every workflow (Token-Permissions), with write elevated
per-job only where the release is created/published.
Fuzzing: cargo-fuzz targets over the untrusted-input surface
(parse_source + chunk_file) plus a ClusterFuzzLite setup that runs them
on PRs (Scorecard Fuzzing). The fuzz/ crate is isolated from the main
workspace.
Registry publishing: release.yml now publishes the crates to crates.io
(cargo publish, indexer first then bin) and the npx wrapper to npm
(Scorecard Packaging). Adds the crate metadata crates.io requires
(description/keywords/categories/repository).
Branch protection on main: PRs + strict status checks (CI, CodeQL, deny,
audit, offline-isolation) + linear history + no force-push, enforced for admins.

Changed

chunks_vec width is parametrized by the active embedder's dimension
(open_db_with(path, dim)), decoupling the vector-table DDL from the
EMBED_DIM = 384 feature-hash constant so an opt-in model with a different
dimension (e.g. EmbeddingGemma 256/768) stores correctly. The default path
(open_db) stays byte-identical; the existing refuse-to-mix guard rejects an
index built with a different embedder id/dim.

Download apohara-codesearch 0.2.0

File	Platform	Checksum
apohara-codesearch-aarch64-apple-darwin.tar.xz	Apple Silicon macOS	checksum
apohara-codesearch-x86_64-apple-darwin.tar.xz	Intel macOS	checksum
apohara-codesearch-x86_64-pc-windows-msvc.zip	x64 Windows	checksum
apohara-codesearch-aarch64-unknown-linux-gnu.tar.xz	ARM64 Linux	checksum
apohara-codesearch-x86_64-unknown-linux-gnu.tar.xz	x64 Linux	checksum

Verifying GitHub Artifact Attestations

The artifacts in this release have attestations generated with GitHub Artifact Attestations. These can be verified by using the GitHub CLI:

gh attestation verify <file-path of downloaded artifact> --repo SuarezPM/apohara-codesearch

You can also download the attestation from GitHub and verify against that directly:

gh attestation verify <file-path of downloaded artifact> --bundle <file-path of downloaded attestation>

Assets 16

06 Jun 19:16

github-actions

v0.2.0-rc.1

f20f31e

v0.2.0-rc.1 Pre-release

Pre-release

Download apohara-codesearch 0.2.0-rc.1

File	Platform	Checksum
apohara-codesearch-aarch64-apple-darwin.tar.xz	Apple Silicon macOS	checksum
apohara-codesearch-x86_64-apple-darwin.tar.xz	Intel macOS	checksum
apohara-codesearch-x86_64-pc-windows-msvc.zip	x64 Windows	checksum
apohara-codesearch-aarch64-unknown-linux-gnu.tar.xz	ARM64 Linux	checksum
apohara-codesearch-x86_64-unknown-linux-gnu.tar.xz	x64 Linux	checksum

Verifying GitHub Artifact Attestations

The artifacts in this release have attestations generated with GitHub Artifact Attestations. These can be verified by using the GitHub CLI:

gh attestation verify <file-path of downloaded artifact> --repo SuarezPM/apohara-codesearch

You can also download the attestation from GitHub and verify against that directly:

gh attestation verify <file-path of downloaded artifact> --bundle <file-path of downloaded attestation>

Assets 16

05 Jun 16:29

github-actions

v0.1.0

6d274a7

v0.1.0

Download apohara-codesearch 0.1.0

File	Platform	Checksum
apohara-codesearch-aarch64-apple-darwin.tar.xz	Apple Silicon macOS	checksum
apohara-codesearch-x86_64-apple-darwin.tar.xz	Intel macOS	checksum
apohara-codesearch-x86_64-pc-windows-msvc.zip	x64 Windows	checksum
apohara-codesearch-aarch64-unknown-linux-gnu.tar.xz	ARM64 Linux	checksum
apohara-codesearch-x86_64-unknown-linux-gnu.tar.xz	x64 Linux	checksum

Assets 16

Releases: SuarezPM/apohara-codesearch

0.3.0 - 2026-06-11

Release Notes

Added

Changed

Notes

Download apohara-codesearch 0.3.0

Verifying GitHub Artifact Attestations

Uh oh!

0.2.0 - 2026-06-07

Release Notes

Added

Changed

Download apohara-codesearch 0.2.0

Verifying GitHub Artifact Attestations

Uh oh!

v0.2.0-rc.1

Download apohara-codesearch 0.2.0-rc.1

Verifying GitHub Artifact Attestations

Uh oh!

v0.1.0

Download apohara-codesearch 0.1.0

Uh oh!