We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Check upon issue creation:
For XX in:
Parameters:
NEXT_MODEL_PATH=<org>/<model> NEXT_MODEL_REVISION=main NEXT_MODEL_PRECISION=float16 MAX_LENGTH=2048 GPU_MEMORY_UTILIZATION=0.8 VLLM_SWAP_SPACE=4
ToDos:
The text was updated successfully, but these errors were encountered:
I'm getting some OOM issues here, which is really strange (8 H100 GPUs should suffice). I'll look more into this...
Sorry, something went wrong.
No branches or pull requests
Check upon issue creation:
For XX in:
Parameters:
ToDos:
The text was updated successfully, but these errors were encountered: