-
Notifications
You must be signed in to change notification settings - Fork 545
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
BAAI/bge-reranker-v2-m3 模型中是如何計算輸入的 max_length ? #740
Comments
The upper bound of |
So, The maximum input to the rerank model is 8192, and will be truncated if exceeded. But I found that the max_length in the parameters of compute_score is 512 by default. |
A larger max_length allows the model to process long texts, but it comes with more computational consumption. The small default value: 512 is to speed up the inference. If most of your text is long, we recommend using a larger |
So, I should decide an appropriate embedding length based on the length of my comparison documents to get better results. |
關於這個 max_length = 512 具體是什麽單位? 是 token 還是字符長度? 超過後又是如何處理? 直接截斷嗎?
reranker-v2-m3 模型本身的 max_length 有上限嗎? 這個 512 是可調整(例如拉高到1024 或 4096) 還是不建議調整?
The text was updated successfully, but these errors were encountered: