We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
I only got a little improvement than the native code. Was there any I missed?
cli 1: time python generate.py --compile --compile_prefill --checkpoint_path /root/gpt-fast/codellama-34b-python/model_int8.pth --prompt "def quicksort(arr):" --max_new_tokens 32 --num_samples 50
cli 2: time python generate.py --checkpoint_path /root/gpt-fast/codellama-34b-python/model_int8.pth --prompt "def quicksort(arr):" --max_new_tokens 32 --num_samples 50
result of cli 1: 4.45tokens/sec & 151.52GB/s for bandwidth result of cli 2: 4.24tokens/sec & 144.55GB/s for bandwidth
relative improvement(compile vs not compile): speed: 4.9% memory bandwidth: 4.8%
gpu: 1*L40S docker: python:3.9 pytorch installation: pip install torch
The text was updated successfully, but these errors were encountered:
Are you using pytorch nightly? This perf seems much worse than I would expect
Sorry, something went wrong.
No branches or pull requests
I only got a little improvement than the native code. Was there any I missed?
Commands
cli 1:
time python generate.py --compile --compile_prefill --checkpoint_path /root/gpt-fast/codellama-34b-python/model_int8.pth --prompt "def quicksort(arr):" --max_new_tokens 32 --num_samples 50
cli 2:
time python generate.py --checkpoint_path /root/gpt-fast/codellama-34b-python/model_int8.pth --prompt "def quicksort(arr):" --max_new_tokens 32 --num_samples 50
Results
result of cli 1: 4.45tokens/sec & 151.52GB/s for bandwidth
result of cli 2: 4.24tokens/sec & 144.55GB/s for bandwidth
relative improvement(compile vs not compile):
speed: 4.9%
memory bandwidth: 4.8%
Env
gpu: 1*L40S
docker: python:3.9
pytorch installation: pip install torch
The text was updated successfully, but these errors were encountered: