-
Notifications
You must be signed in to change notification settings - Fork 949
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Publish updated candle version on crates.io #1761
Comments
Hopefully we'll release 0.4.0 in the next few days. |
Version 0.4.0 should now be available on crates.io! |
Hi @LaurentMazare, I believe that the version published on crates.io doesn't include the latest upgrades (cuda inference for quantized models).
My system: Default host: x86_64-unknown-linux-gnu stable-x86_64-unknown-linux-gnu (default) nvcc: NVIDIA (R) Cuda compiler driver Thank you. |
I don't think the |
@LaurentMazare you're right, the latest upgrades are included, sorry for the oversight. let inv_freq = Tensor::from_vec(inv_freq, (1, inv_freq_len), dev)?.to_dtype(dtype)? that gives: The problem arises only when device is cuda. With the previous version 0.3.3 all works fine. |
Upgrading Cuda driver to >= 545 solves the issue. |
Hi all. In the last month several great updates have been released in candle, especially for quantized models inference (I have been waiting for a long time for cuda acceleration).
On crates.io the crate version is still stuck at 3.3.3. When is the new crate version scheduled to be published on crates.io?
Thank you very much.
The text was updated successfully, but these errors were encountered: