Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

how can I handle graph with more than 50k nodes? #13

Closed
YunseobShin opened this issue Dec 19, 2018 · 2 comments
Closed

how can I handle graph with more than 50k nodes? #13

YunseobShin opened this issue Dec 19, 2018 · 2 comments

Comments

@YunseobShin
Copy link

Traceback (most recent call last):
File "/home/lib/python3.5/site-packages/joblib/externals/loky/backend/queues.py", line 157, in _feed
send_bytes(obj)
File "/usr/lib/python3.5/multiprocessing/connection.py", line 200, in send_bytes
self._send_bytes(m[offset:offset + size])
File "/usr/lib/python3.5/multiprocessing/connection.py", line 393, in _send_bytes
header = struct.pack("!i", n)
struct.error: 'i' format requires -2147483648 <= number <= 2147483647

Hi. I really appreciate your library. I can get more accurate result with your code even than the author's one.

However, I have a problem that I can't learn the embedding of graphs whose nodes are about more than 50,000.

I guess the joblib module for "parallel_generate_walks" has a limit for large dataset.

Is this code originally limited to be so?

@shuttle1987
Copy link

When this bug is fixed in Python upstream https://bugs.python.org/issue17560 you may find this fixed without having to modify your code. However it may be the case that at such large sizes there are better ways for handling the shared state in the first place.

@eliorc
Copy link
Owner

eliorc commented Jan 15, 2019

In the latest version, it is now possible to use memmapping when using parallel execution using the temp_folder variable on the Node2Vec constructor. Just pass there a location with enough space and it is supposed to work

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants