You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Sorry if this is an silly request, but I have followed the release of the Jina colbert model and tried to drop in replace ColBERT with the model, mainly because of its longer document length
I am not building indexes with RAGatouille, just reranking candidates with RAG.rerank. I get much, much worse evaluation results after that switch so I assume that something isn't quite right here. Any guidance?
Reading the RAGatouille and ColBERT code I see there are some "auto" parameters for max tokens and max document length. Does Jina require a different config or did I simply misunderstand that it can replace ColBERT?
Thanks!
The text was updated successfully, but these errors were encountered:
Thank you for flagging. I'm very short on time so diagnosing will be a little longer than usual, but the results shouldn't be drastically different. Initial evals showed relative parity between the two models, albeit only on 5-10 test cases.
I'm wondering if this is due to (1) a problem loading the model properly, (2) something with the rank() function (which uses the same internal functions as other, more tested functions, but it could happen).
What version of ragatouille and colbert-ai are you on? Jina ColBERT only loads properly with colbert-ai with >=0.2.19, previous versions of colbert-ai initialised weights wrong.
If that's not the issue, would you mind sharing some example code/documents where the issue occurs? Thank you!
Sorry if this is an silly request, but I have followed the release of the Jina colbert model and tried to drop in replace ColBERT with the model, mainly because of its longer document length
I am not building indexes with RAGatouille, just reranking candidates with
RAG.rerank
. I get much, much worse evaluation results after that switch so I assume that something isn't quite right here. Any guidance?Reading the RAGatouille and ColBERT code I see there are some "auto" parameters for max tokens and max document length. Does Jina require a different config or did I simply misunderstand that it can replace ColBERT?
Thanks!
The text was updated successfully, but these errors were encountered: