Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bhtsne for large datasets #81

Open
aaronfderybel opened this issue Jun 26, 2019 · 1 comment
Open

Bhtsne for large datasets #81

aaronfderybel opened this issue Jun 26, 2019 · 1 comment

Comments

@aaronfderybel
Copy link

aaronfderybel commented Jun 26, 2019

Hello @lvdmaaten ,

I've read on your tSNE homepage that you can handle datasets with up to 30 million examples https://lvdmaaten.github.io/tsne/. I'm currently working in google colab

I currently have a dataset with 2 million examples and each example is a 100-d vector.
Using verbose= False, I get the following:
image

Using verbose=True as suggested I get:
image

I'm not sure what this means or how i should proceed. The example with the Mnist dataset works perfect using verbose=False

@SamGG
Copy link

SamGG commented Jun 26, 2019

Not the solution, but for large dataset you should also look at
http://www.omiq.ai/opt-SNE
https://www.cytosplore.org
Best.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants