Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

cannot scale to the large-scale dataset Reddit #2

Open
xh3204 opened this issue Aug 1, 2022 · 1 comment
Open

cannot scale to the large-scale dataset Reddit #2

xh3204 opened this issue Aug 1, 2022 · 1 comment

Comments

@xh3204
Copy link

xh3204 commented Aug 1, 2022

According to the example code, it needs to compute the distance between arbitrary node pairs, which is O(n^2) complexity and leads to OOM. How can it scale to the large-scale dataset on Reddit?

Thank you~

@pkuwyx
Copy link

pkuwyx commented Aug 1, 2022

Thank you for your attention on our work!
In fact, our method does not need to use the distance of all node pairs in the same time.
Therefore, to solve the out of memory problem when the machine can not afford,we just use the data when needed. Save the result of each node In the disk and read it when needed, or recompute the result when needed may be helpful.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants