Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

vocab.txt & vectors.txt structure #45

Open
RaBa01 opened this issue Jan 12, 2017 · 1 comment
Open

vocab.txt & vectors.txt structure #45

RaBa01 opened this issue Jan 12, 2017 · 1 comment

Comments

@RaBa01
Copy link

RaBa01 commented Jan 12, 2017

hi
i build my model with gensim Word2Vec and i want to learn my model with NER.
is there any document to explain vocab.txt and vectors.txt structure?

i used this script:
bin/dl-ner.py ner.dnn -t train+dev
—vocab vocab.txt —vectors vectors.txt
—caps —suffix —suffixes —gazetteer eng.list
-e 40 -l 0.01 -w 5 -n 300 -v

@kjyong
Copy link

kjyong commented Oct 31, 2017

I tried to analyze the code and converted gensim word2vec model to input for deepnl.
The format I tried is like below, and it works.
[vocab.txt]

word1
word2
word3
...

[vectors.txt]

2 3
word1 1.0 2.0 3.0
word2 2.3 3.5 1.2
...

the first line in vectors.txt means number of words and their dimension.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants