Skip to content

DM-001: Fix dimension mismatch β€” align all embeddings to 384#21

Merged
sparkling merged 1 commit intomainfrom
patch/DM-001
Mar 14, 2026
Merged

DM-001: Fix dimension mismatch β€” align all embeddings to 384#21
sparkling merged 1 commit intomainfrom
patch/DM-001

Conversation

@sparkling
Copy link
Copy Markdown
Owner

Fixes #20. Vector indexes (768), ONNX (384), neural (256) all mismatched. Aligned everything to 384-dim.

Vector indexes were hardcoded to 768-dim but ONNX MiniLM-L6 produces
384-dim embeddings. HNSW index could never be built β€” all searches
fell back to brute-force. Neural training was capped at 256-dim,
preventing neural patterns from sharing the memory index.

Fixes:
- vector_indexes schema: 768 β†’ 384 (matches MiniLM-L6 output)
- reasoningbank/agentic-flow fallback dimensions: 768 β†’ 384
- neural --dim max/default: 256 β†’ 384 (same dimension as memory)

Result: all embedding paths produce 384-dim vectors that match the
HNSW index configuration. HNSW can now be built and used.

Co-Authored-By: claude-flow <ruv@ruv.net>
@sparkling sparkling merged commit 76e7bf8 into main Mar 14, 2026
sparkling added a commit that referenced this pull request Mar 14, 2026
)

Supersedes PR #21 (which set 384-dim). The target machine has 32 cores
+ 187GB RAM β€” can easily run all-mpnet-base-v2 (110M params, ~5ms/embed).

Changes:
- ONNX model: all-MiniLM-L6-v2 (384-dim) β†’ all-mpnet-base-v2 (768-dim)
- Vector indexes: 384 β†’ 768 (now matches model output)
- Neural --dim: max/default 384 β†’ 768 (shares same HNSW index)
- All dimension fallbacks aligned to 768
- MCP tools: default model updated, MiniLM kept as enum option

All embedding paths now produce 768-dim vectors matching the HNSW index.

Co-Authored-By: claude-flow <ruv@ruv.net>
sparkling added a commit that referenced this pull request Mar 14, 2026
Patches applied:
- HK-002: fail-loud bridge errors in postEdit/postCommand/postTask (#18)
- HK-003: real metrics from sona-patterns/intelligence files (#19)
- HK-004: respect daemon.autoStart from settings.json (#20)
- HK-005: cross-process daemon PID-file guard (#21)
- NS-001: search/list default namespace 'default' β†’ 'all' (#22)
- NS-002: require explicit namespace for store/delete/retrieve (#23)
- NS-003: fix 'pattern' β†’ 'patterns' namespace typo (#24)
- WM-103: MetadataFilter + MMR diversity in search pipeline (#25)
- WM-104: CausalRecall integration in routing (#26)
- WM-105: MemoryGraph importance scoring in store/search (#27)
- WM-106: activate LearningBridge in intelligence_learn (#28)
- WM-107: fix falsy-OR quality bug, fake session ID/stats (#29)
- WM-114: wire AttentionService across 4 controllers (ruvnet#30)

Co-Authored-By: claude-flow <ruv@ruv.net>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

DM-001: Vector index dimension mismatch β€” 768 vs 384 breaks HNSW

1 participant