[FEA] TSNE PCA Intialization #1029

danielhanchen · 2019-08-19T12:25:29Z

Currently, TSNE's embeddings are intialized from random sampling from a uniform(-0,0001, 0.0001) distribution.

It has been shown that to allow TSNE to be stable during its gradient updates, and to preserve global structure more effectively, utilizing a Randomized SVD or PCA or even a Spectral Embedding as the intial conditions can help.

Currently, cuML has excellent implementations for Truncated SVD and Spectral Embeddings, and so cuML's internal primitives can be used.

Likewise, since TSNE only requires 2 initial components, Halko and Martinsson's 2011 Randomized First Pass SVD can be also investigated, since it has superior speed yet it is accurate.

cjnolet · 2021-04-14T19:52:15Z

I’ve created a more up to date issue for this feature and referenced this issue so that we don’t lose the history trail. Now that we have have a sparse PCA in the Python layer that can accept sparse inputs, I don’t think it should be too hard to port that back to c++, at least for an initial version. If for some reason we need more speed or find issues with numerical stability, we can always try using rsvd or another approach.

danielhanchen added ? - Needs Triage Need team to review and classify feature request New feature or request labels Aug 19, 2019

cjnolet added this to Issue- Needs Prioritizing in v0.10 Release via automation Sep 15, 2019

cjnolet added Algorithm API Change For tracking changes to algorithms that might effect the API CUDA / C++ CUDA issue Cython / Python Cython or Python issue tests Unit testing for project and removed feature request New feature or request labels Sep 15, 2019

JohnZed moved this from Issue- Needs Prioritizing to Issue- P2 in v0.10 Release Sep 23, 2019

JohnZed moved this from Issue- P2 to Defer Post 0.10 in v0.10 Release Sep 26, 2019

cjnolet added this to Issue-Needs prioritizing in v0.11 Release via automation Oct 10, 2019

cjnolet removed this from Defer Post 0.10 in v0.10 Release Oct 10, 2019

cjnolet moved this from Issue-Needs prioritizing to Issue-P1 in v0.11 Release Oct 24, 2019

cjnolet assigned danielhanchen Nov 8, 2019

This was referenced Nov 10, 2019

[WIP] TSNE PCA Clean PR for Branch 11 [skip-ci] #1350

Closed

[REVIEW] TSNE PCA Init + 50% Memory Reductions #1356

Closed

JohnZed moved this from Issue-P1 to Defer to post-0.11 in v0.11 Release Nov 21, 2019

cjnolet added this to Issue-Needs prioritizing in v0.12 Release via automation Nov 21, 2019

cjnolet removed this from Defer to post-0.11 in v0.11 Release Nov 21, 2019

fondaing removed this from Issue-Needs prioritizing in v0.12 Release Mar 5, 2020

fondaing added this to Issue-Needs prioritizing in v0.13 Release via automation Mar 5, 2020

fondaing removed this from Issue-Needs prioritizing in v0.13 Release Apr 3, 2020

fondaing added this to Issue-Needs prioritizing in v0.14 Release via automation Apr 3, 2020

fondaing removed this from Issue-Needs prioritizing in v0.14 Release Jun 9, 2020

fondaing added this to Issue-Needs prioritizing in v0.15 Release via automation Jun 9, 2020

fondaing added this to Issue-Needs prioritizing in v0.16 Release via automation Sep 24, 2020

fondaing removed this from Issue-Needs prioritizing in v0.15 Release Sep 24, 2020

cjnolet added this to Issue-Needs prioritizing in v0.17 Release via automation Oct 23, 2020

cjnolet removed this from Issue-Needs prioritizing in v0.16 Release Oct 23, 2020

cjnolet removed this from Issue-Needs prioritizing in v0.17 Release Oct 27, 2020

wphicks added this to Issue-Needs prioritizing in v21.06 Release via automation Mar 22, 2021

cjnolet mentioned this issue Apr 14, 2021

[FEA] PCA initialization for TSNE #3458

Open

cjnolet removed this from Issue-Needs prioritizing in v21.06 Release Apr 14, 2021

cjnolet closed this as completed Apr 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] TSNE PCA Intialization #1029

[FEA] TSNE PCA Intialization #1029

danielhanchen commented Aug 19, 2019 •

edited

Loading

cjnolet commented Apr 14, 2021

[FEA] TSNE PCA Intialization #1029

[FEA] TSNE PCA Intialization #1029

Comments

danielhanchen commented Aug 19, 2019 • edited Loading

cjnolet commented Apr 14, 2021

danielhanchen commented Aug 19, 2019 •

edited

Loading