-
Notifications
You must be signed in to change notification settings - Fork 54
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Spots disappear when zoomed out #77
Comments
This is likely a result of the insert order--the three spots are probably clusters that were in the last 50% inserted. At initial zoom, not all 10 million points are shown to reduce network transfer/improve render performance. So if--let's say--these are word2vec embeddings which pop out by default in frequency order, these may be clusters of rarely used words. Etc. To confirm you could run plot.plotAPI({"encoding": {"color": {"field": "ix", "range": "viridis", domain: [1, 10e6]}}}) which will make the color on the chart reflect input order. Easiest solutions are:
|
Also, just out of curiosity, are you able to share what the data is? There aren't that many 10m point t-sne embeddings in the world yet! |
@bmschmidt ah, that makes sense and you were totally right. Thank you so much! Random shuffle fixed the issue. Regarding data - sure this is actually a subset of PubMed. |
@bmschmidt I have a follow up question about this recommendation. |
As currently implemented, The reason is that it's not actually uniform sampling. At insert, every point is assigned an index number from 1 to (in your case) 10 million. At zoom level 1 all points that with an index below 500K will be shown; if you zoom in to show only a quarter of the data all points with an index level below 2m will be shown; if you zoom in to a quarter of that region all points with an index level below 8m will be shown; etc. |
@bmschmidt ah, that makes sense, thank you! |
Thanks for open sourcing, so far works great. Have noticed a small artefact, see:
artefact_vid.mov
Notice 3 spots that disappear when we zoom out and appear when zoomed in. There's roughly 10M points there. Default
quadfeather
flags. Latest deepscatter.The text was updated successfully, but these errors were encountered: