-
Notifications
You must be signed in to change notification settings - Fork 564
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
About CUDA version and TRT version of 1.14.0 #870
Comments
Thanks for testing! If it works with an older CUDA you can still use the older CUDA, but I've switched my testing to be on CUDA 12 going forward, so officially that's the only one that will be recommended. Also, when I upgraded my own machines from CUDA 11.4 to CUDA 12.1, I also got like a 10% improvement on the CUDA backend too, so there may be some benefits to upgrading as well even if an older version works. |
Well, that's good news. I would like to upgrade to CUDA12.1 and observe how the speed changes. |
I have prepared CUDA and CUDNN as recommended on the release page.
The following files were placed in the katago-cuda folder. I ran a benchmark test to compare it to my previous environment, and the speed improvement was only around 3%. Are the six files shown in the image above necessary and sufficient to run the newly released katago-cuda? |
I said that the speedup rate is 3%, but it changes depending on the number of threads used, and it ranges from -1 to +11%. |
Thank you for your efforts in the development of katago.
Also, thank you for letting me use Katago for free.
The cuda version of 1.14.0 worked with the following three libraries that I had been using since before 1.13.0.
The release page says "CUDA 12.1.x and CUDNN 8.9.7 are required", but if it works, does it matter which one I use?
Until now, when I used the cuda version, in addition to the three above, I also placed the following three in the same folder.
If the above three work, are the bottom three unnecessary?
1.14.0-trt did not work with dependent TensorRT-8.5.2.2. I prepared a new TensorRT-8.6.1.6 and it worked.
Compared to 1.13.1-trt, 1.14.0-trt is about 10% faster.
The text was updated successfully, but these errors were encountered: