-
Notifications
You must be signed in to change notification settings - Fork 565
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
why the time cost of opencl version is less than cuda version #48
Comments
|
How many searchThreads are you using for each? |
they shared one config, both 1000 searchThreads |
Ummm... that's a pretty absurd number of search threads. I would not expect good play using that many threads, nor do I have any idea how to think about how performance behaves with that many threads.... |
my fault, it was 1 thread. max visits was 1000 |
Well, 1 thread is probably too few. You can get much better performance on both increasing threads (the best number of threads for each one may be different) It's possible that OpenCL will be faster than CUDA, or it's possible that CUDA will be faster than OpenCL, but 1 thread is too few to find out for sure. |
I am testing the 1.2 final version with searchThreads 3( my cpu has 4 threads)
also find opencl version is faster than cuda
searchThreads 4
i also find the mem used by cuda(800M) is about twice of opencl(320M) in windows taskmgr
searchThreads 8
the game will slow down if too many Threads i use a 10x128 model, same result
192x15model , the time is nearly equal
256x20model , the time is nearly equal
|
it seems no speedup when Threads >2, maybe it limited by my cpu? |
cuda version is faster with 20x256 model |
Going ahead and closing this due to lack of new activity, feel free to comment back if you think there's anything to add or reopen. I don't think its too surprising that OpenCL can be faster than CUDA. I might spend some time optimizing one or both a little more before next release. Thanks for posting your stats and results! |
D:\KataGo-1.1\katago12bc = 1.2beta cuda
D:\KataGo12b\katago =1.2beta opencl
The text was updated successfully, but these errors were encountered: