-
Notifications
You must be signed in to change notification settings - Fork 542
[inference] Flipped usage of wait_for_model flag #649
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Wauplin
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good at first glance
|
will wait for others to confirm |
| const response = await (options?.fetch ?? fetch)(url, info); | ||
|
|
||
| if (options?.retry_on_error !== false && response.status === 503 && !options?.wait_for_model) { | ||
| if (options?.retry_on_error !== false && response.status === 503 && options?.wait_for_model) { |
This comment was marked as off-topic.
This comment was marked as off-topic.
Sorry, something went wrong.
|
you should pass (but yes we could probably remove |
|
@coyotte508 submitted #650, should I close this PR #649 then? |
|
yes you can close :) |
|
superceded by #650 |
from #649 (comment) > you should pass retry_on_error: false instead of wait_for_model: false in widgets, I think that would solve it > (but yes we could probably remove retry_on_error as some point and check wait_for_model for false, true or undefined) ### on main from @gary149 > 503 but no error appears in the UI? https://huggingface.co/microsoft/Phi-3-mini-128k-instruct?text=hi <img src="https://github.com/huggingface/huggingface.js/assets/11827707/c07ce78d-3e00-45e1-b07d-dfa709c2ebe5" width="200px"> ### this PR/branch https://github.com/huggingface/huggingface.js/assets/11827707/05cfd37b-fd15-482e-82ae-cdb76961707b
I think
wait_for_modelusage is flipped incorrectly in #148. Check files tabWhat prompted this error:
from @gary149
Explanation: although widget sends
wait_for_model: false, inference.js does NOT throw error and tries to load the model and make a call to this model once it is loaded