Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Different vocabularies generated from same data #24

Closed
dukeNashor opened this issue Apr 11, 2019 · 3 comments
Closed

Different vocabularies generated from same data #24

dukeNashor opened this issue Apr 11, 2019 · 3 comments

Comments

@dukeNashor
Copy link

I tested fbow on same data for several times. It is worth noticing that different instances operating on the same data generated different vocabularies, while dbow3 generated the exact same vocabulary. is this normal?

@dukeNashor
Copy link
Author

After some debugging, i noticed that the feature points in initial cluster may (almost always, in my case) duplicate; after changing the initial cluster selection part, everything worked fine.

@RashidLadj
Copy link

Hi @dukeNashor

I can't quite understand the problem, I took a look at your repository but you haven't updated it on git, can you tell me where exactly is the problem?

Thank you.

@dukeNashor
Copy link
Author

Hi @dukeNashor

I can't quite understand the problem, I took a look at your repository but you haven't updated it on git, can you tell me where exactly is the problem?

Thank you.

nvm, the fbow's implementation for choosing the initial cluster centers is different from dbow's, which leads to the fact that different runs of fbow generate different vocabularies.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants