You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you for using txtai and trying it out on your data. It is a common with similarity search to run into issues with accuracy when there is large variability in the length of content. Without knowing the exact data you're working with, here are some general ideas to try:
Try word embeddings vs transformer models. Word embedding models have different ways to average the embeddings together, such as BM25. BM25 factors in the length of the content as part of it's scoring algorithm.
The txtai library performs less accurately when the given input matching texts are too long.
The text was updated successfully, but these errors were encountered: