Different vocabularies generated from same data #24

dukeNashor · 2019-04-11T10:26:42Z

I tested fbow on same data for several times. It is worth noticing that different instances operating on the same data generated different vocabularies, while dbow3 generated the exact same vocabulary. is this normal?

dukeNashor · 2019-04-16T09:49:07Z

After some debugging, i noticed that the feature points in initial cluster may (almost always, in my case) duplicate; after changing the initial cluster selection part, everything worked fine.

RashidLadj · 2020-08-12T11:18:13Z

Hi @dukeNashor

I can't quite understand the problem, I took a look at your repository but you haven't updated it on git, can you tell me where exactly is the problem?

Thank you.

dukeNashor · 2020-08-12T11:35:26Z

Hi @dukeNashor

I can't quite understand the problem, I took a look at your repository but you haven't updated it on git, can you tell me where exactly is the problem?

Thank you.

nvm, the fbow's implementation for choosing the initial cluster centers is different from dbow's, which leads to the fact that different runs of fbow generate different vocabularies.

dukeNashor closed this as completed Aug 12, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Different vocabularies generated from same data #24

Different vocabularies generated from same data #24

dukeNashor commented Apr 11, 2019

dukeNashor commented Apr 16, 2019

RashidLadj commented Aug 12, 2020

dukeNashor commented Aug 12, 2020

Different vocabularies generated from same data #24

Different vocabularies generated from same data #24

Comments

dukeNashor commented Apr 11, 2019

dukeNashor commented Apr 16, 2019

RashidLadj commented Aug 12, 2020

dukeNashor commented Aug 12, 2020