How to set faster? #28

ivysrono · 2019-06-22T10:14:06Z

numSearchThreads is limitted by CPU or GPU？
nnMaxBatchSize should be equal to numSearchThreads?
Others?

lightvector · 2019-06-22T11:53:17Z

numSearchThreads is the number of CPU threads to use. If your GPU is powerful, it can actually be much higher than the number of CPU cores on your system because you will need many threads in order to feed large enough batches to the GPU to get good GPU use.

nnMaxBatchSize should be be around the number of CPU threads you are using, yes, but how large it needs to be can vary if you are using multiple GPUs instead of 1 GPU.

You will want to use cudaUseFP16 and cudaUseNHWC if your GPU has FP16 tensor cores.

If you are doing long searches with large numbers of visits, and you don't mind using more RAM on your machine, nnCacheSizePowerOfTwo can be increased, and also nnMutexPoolSizePowerOfTwo. I think the config comes with nnCacheSizePowerOfTwo = 18 meaning that it will cache 2^18 = 262144 neural net results, but due to birthday paradox you may start getting some noticeable cache losses once you start searching in the high thousands of visits.

Thanks for the question, hope that helps. I'll add some more documentation about various parameters to the README or otherwise within the release itself before too long, I just haven't done it yet.

ivysrono · 2019-06-22T12:14:44Z

Thank you very much.

ivysrono · 2019-06-22T14:32:00Z

#29

ivysrono closed this as completed Jun 22, 2019

lightvector mentioned this issue Jun 27, 2019

Fastest config for v1.1 #29

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to set faster? #28

How to set faster? #28

ivysrono commented Jun 22, 2019

lightvector commented Jun 22, 2019

ivysrono commented Jun 22, 2019

ivysrono commented Jun 22, 2019

How to set faster? #28

How to set faster? #28

Comments

ivysrono commented Jun 22, 2019

lightvector commented Jun 22, 2019

ivysrono commented Jun 22, 2019

ivysrono commented Jun 22, 2019