fix(release): v0.2.2 — Critical Hotfix for config_override Initialization by yashdesai023 · Pull Request #4 · vectordbpipe/vectorDBpipe

yashdesai023 · 2026-03-01T08:21:34Z

Summary

This PR resolves 7 critical bugs that affected all users initializing VDBpipe via config_override (e.g., Google Colab users). It is a pure hotfix — no breaking changes.

What's Fixed

Bug	Fix
`'NoneType' object has no attribute 'tokenize'` crash	`_safe_reinit` now unconditionally reinitializes Embedder from `embedding.model_name` key
LLM (Sarvam/Google/Cohere) not initialized via `config_override`	Added all missing LLM providers to `_safe_reinit`
Graph always 0 nodes after ingestion	Added `_regex_graph_extract()` as a no-LLM fallback using regex NLP
Corrupted PDF crash (`FzErrorFormat`)	`_load_pdf` now loads pages by index with per-page `try/except`
Engine 2/3 returning "LLM not configured"	All engines now return readable, useful fallback content without LLM
Engine 3 returning irrelevant graph output	GraphRAG now filters edges by query keywords + vectors fallback
`generate_response()` signature mismatch	All engine calls now correctly pass `retrieved_context` argument

Testing

Tested locally with faiss + all-MiniLM-L6-v2 (no LLM)
Tested in Google Colab with provider: null config
Tested with Sarvam LLM (sarvam-m)
All 4 engines verified working (Engines 2/3 fallback + Engine 1/4 with LLM)
Corrupted PDF (economy-ketan-sir.pdf) ingested successfully with skipped bad pages

…init

…pages

…x generate_response signatures

…ne4): actionable error with config snippet

… fix details

…hunking, PPTX Loader, Backend VDBpipe Upgrade, 39-test Suite, TUI Diagnostics BREAKING CHANGES: None — fully backwards compatible. Architecture: - Refactor VDBpipe to pure composition (remove TextPipeline inheritance, delete _safe_reinit) - Replace TextPipeline with VDBpipe in backend routers (ingest, chat, retrieve) Semantic OmniRouter (#3): - Embedding cosine-similarity intent routing with threshold=0.35 - Pre-computed intent prototype embeddings per engine at startup - Keyword fallback when embedder unavailable Persistence (#4): - Auto-save graph + page_index as JSON after every ingest() - Auto-load on VDBpipe.__init__() — survives restarts Streaming (#15): - BaseLLMProvider.stream_response() with safe default wrapper - OpenAILLMProvider real SSE streaming (requests stream=True) - VDBpipe.stream_query() generator - POST /pipelines/chat/stream SSE endpoint (StreamingResponse) Data Loading (#13): - Add PPTX support via python-pptx (_load_pptx) - Register .pptx in DataLoader.supported_ext Chunking (#14): - Add chunk_text_sentences() sentence-boundary sliding-window chunker - Configurable max_tokens and overlap_sentences - Old chunk_text() kept for compatibility Tests (#12): - Expand from 4 to 39 tests across 12 test classes - All tests mocked — no GPU/API keys required TUI (#16, #17, #18): - System Doctor: 6 real execSync runtime checks - SetupWizard: setStep(8) on write error (fix silent failure) - SetupWizard: validateAndSave() with per-provider API key validation Bug Fixes: - File isolation: uploads go to data/<user_id>/<uuid>_filename - Cache eviction on config update in backend Deps: add python-pptx>=0.6.23 to setup.py install_requires TUI: bump to v0.1.4, smarter postinstall.cjs (python -m pip)

yashdesai023 added 8 commits February 28, 2026 23:21

fix(pipeline): resolve embedder NoneType tokenize error in Omni-RAG init

488858c

chore: revert version to 0.2.2 as per user request

4ab829d

fix(llm): add missing google and cohere providers to Omni-RAG safe_re…

54af881

…init

ci: enable workflows on v* branches

53c7a33

fix(loader): handle corrupted PDF page trees gracefully, skip broken …

95d2036

…pages

fix(engines): Engine2/3 LLM-free fallback; regex graph extraction; fi…

1c3b3fb

…x generate_response signatures

fix(engine3): query-aware graph filtering + vector fallback; fix(engi…

702fad7

…ne4): actionable error with config snippet

docs: update README with v0.2.2 changelog, version badge, and all bug…

96c88cb

… fix details

yashdesai023 merged commit e2f87aa into main Mar 1, 2026
2 checks passed

yashdesai023 mentioned this pull request Mar 3, 2026

feat(v0.2.4): Semantic OmniRouter, Persistence, Streaming, Sentence C… #5

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(release): v0.2.2 — Critical Hotfix for config_override Initialization#4

fix(release): v0.2.2 — Critical Hotfix for config_override Initialization#4
yashdesai023 merged 8 commits intomainfrom
v0.2.2-fix

yashdesai023 commented Mar 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

yashdesai023 commented Mar 1, 2026

Summary

What's Fixed

Testing

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant