Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Switch to FTS index divided by document fields #4

Merged
merged 2 commits into from Jan 29, 2022

Conversation

haroldadmin
Copy link
Owner

@haroldadmin haroldadmin commented Jan 29, 2022

This change allows the FTS index to index data per document property.

  • Index changes from Map<String, Map<Int, Int> (token to document frequencies) - to Map<String, Map<String, Map<Int, Int>> (token to property based document frequencies).
  • Store property-based document lengths for scoring.
  • Benchmarks confirm no noticeable loss of performance.
  • Unit tests confirm no loss of functionality.

Fixes #3.

- This change allows the FTS index to index data per document property.
- Index changes from `Map<String, Map<Int, Int>` (token to document frequencies) - to `Map<String, Map<String, Map<Int, Int>>` (token to property based document frequencies).
- Benchmarks confirm no noticeable loss of performance
- Unit tests confirm no loss of functionality

Fixes #3.
@haroldadmin haroldadmin added the enhancement New feature or request label Jan 29, 2022
@haroldadmin haroldadmin merged commit 2fb32f1 into main Jan 29, 2022
@haroldadmin haroldadmin deleted the fixes/per-field-indexing branch January 29, 2022 19:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
1.0
Awaiting triage
Development

Successfully merging this pull request may close these issues.

Divide the FTS index per document field
1 participant