Suggestion: support corpus_file parameter #18

ubalklen · 2020-07-20T20:31:57Z

Hi.

It would be great if nodevectors could support the word2vec's corpus_file parameter that allows for file-based fast training.

What do the devs think about that?

VHRanger · 2020-07-22T13:08:37Z

Thanks! I'll keep the issue open and start working on it when I have some time.

VHRanger · 2021-01-07T21:44:44Z

Hi,

The long term roadmap for this project is to dissociate away from gensim's word2vec implementation.

The plan is to support huge graph file-based fast training through memory-mapped graphs instead of this approach.

I'll close the issue for now, though I'm interested in more suggestions like this, especially if we end up keeping the gensim dependency around

VHRanger closed this as completed Jan 7, 2021

Provide feedback