Skip to content

Dev#14

Merged
zTgx merged 2 commits intomainfrom
dev
Apr 5, 2026
Merged

Dev#14
zTgx merged 2 commits intomainfrom
dev

Conversation

@zTgx
Copy link
Copy Markdown
Contributor

@zTgx zTgx commented Apr 5, 2026

No description provided.

zTgx added 2 commits April 5, 2026 22:57
- Add bm25 dependency with parallelism feature enabled
- Introduce comprehensive BM25 scoring module with per-field weighting support
- Implement configurable BM25 parameters (k1, b, avgdl) for fine-tuning
- Replace hardcoded BM25 parameters with Bm25Params struct
- Move stopwords and keyword extraction to shared bm25 module
- Add FieldWeights struct for different field importance (title, summary, content)
- Implement BM25 engine with fit-to-corpus functionality for accurate scoring
- Update RelevanceScorer to use new Bm25Params configuration

The new BM25 implementation provides:
- Per-field weighting for more accurate document scoring
- Configurable length normalization and term frequency saturation
- IDF caching for efficient scoring operations
- Query expansion support for enhanced search capabilities
- Update package version in Cargo.toml
@zTgx zTgx merged commit e0f3313 into main Apr 5, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant