Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Suggestion: support corpus_file parameter #18

Closed
ubalklen opened this issue Jul 20, 2020 · 2 comments
Closed

Suggestion: support corpus_file parameter #18

ubalklen opened this issue Jul 20, 2020 · 2 comments

Comments

@ubalklen
Copy link

Hi.

It would be great if nodevectors could support the word2vec's corpus_file parameter that allows for file-based fast training.

What do the devs think about that?

@VHRanger
Copy link
Owner

Thanks! I'll keep the issue open and start working on it when I have some time.

@VHRanger
Copy link
Owner

VHRanger commented Jan 7, 2021

Hi,

The long term roadmap for this project is to dissociate away from gensim's word2vec implementation.

The plan is to support huge graph file-based fast training through memory-mapped graphs instead of this approach.

I'll close the issue for now, though I'm interested in more suggestions like this, especially if we end up keeping the gensim dependency around

@VHRanger VHRanger closed this as completed Jan 7, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants