Infinite loop in Tree Building Kernel #17

rmrao · 2018-09-03T18:14:12Z

This is a bug that has been present for a while in tsne-cuda and that we can't seem to track down. It appears to occur when the number of nodes allocated for the barnes-hut tree is exceeded by the tree building kernel. The way this is handled inside the code at the moment causes an infinite loop.

First of all - this should be impossible. Unless there are two points in exactly the same position, then it only takes 2N tree nodes to separate all the data perfectly. We've checked, and there aren't two points in exactly the same position.

Furthermore, increasing the number of allocated nodes only delays the problem, and doesn't solve it. Printing out the number of used nodes shows that this is below 2N nodes, and well below the new increased number of nodes, in the iteration before the infinite loop.

The bug also appears data/learning rate dependent. Some combinations of datasets, perplexities, and learning rates cause the bug, while others do not. It is at least partially deterministic because the same combination of dataset and input parameters will cause the bug in the same location. However, saving the state of the program (the current embedded positions, the input data, learning rate, etc.) and restarting it at that point does not seem to cause the bug.

The text was updated successfully, but these errors were encountered:

rmrao · 2019-06-11T02:54:50Z

Fixed by #39

rmrao added the bug Something isn't working label Sep 3, 2018

rmrao mentioned this issue Sep 3, 2018

Python: TSNE().fit_transform endless loop with CUDA 9.2 #14

Closed

DavidMChan added the help wanted Extra attention is needed label Sep 4, 2018

cmartinezfleites mentioned this issue Sep 5, 2018

undefined symbol: _ZN5faiss14FaissExcepttionC1ERKSsPKcS4_i #16

Closed

DavidMChan added a commit that referenced this issue Sep 6, 2018

Working conda build. Suffering from #17

aa50f2c

DavidMChan mentioned this issue Sep 12, 2018

Python wrapper error: parallel_for failed: out of memory #12

Closed

rmrao closed this as completed Jun 11, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Infinite loop in Tree Building Kernel #17

Infinite loop in Tree Building Kernel #17

rmrao commented Sep 3, 2018

rmrao commented Jun 11, 2019

Infinite loop in Tree Building Kernel #17

Infinite loop in Tree Building Kernel #17

Comments

rmrao commented Sep 3, 2018

rmrao commented Jun 11, 2019