Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

hello. i have a question on how to make train data. (NER) #4

Open
NuealYoon opened this issue Jul 23, 2020 · 1 comment
Open

hello. i have a question on how to make train data. (NER) #4

NuealYoon opened this issue Jul 23, 2020 · 1 comment

Comments

@NuealYoon
Copy link

I was looking at the train_data.txt file to train the model.

("""It's a visually stunning movie, finding moments both macro and micro to highlight the beautiful imagination that "Star Wars" can evoke.""", {
'words': ['It', "'s", 'a', 'visually', 'stunning', 'movie', ',', 'finding', 'moments', 'both', 'macro', 'and', 'micro', 'to', 'highlight', 'the', 'beautiful', 'imagination', 'that', '"', 'Star', 'Wars', '"', 'can', 'evoke', '.'],
'entities': [(25, 30, 'PRODUCT'), (114, 123, 'WORK_OF_ART')],
'heads': [1, 1, 5, 4, 5, 1, 1, 1, 7, 10, 8, 10, 10, 14, 7, 17, 17, 14, 24, 24, 21, 24, 23, 24, 17, 1],
'deps': ['nsubj', 'ROOT', 'det', 'advmod', 'amod', 'attr', 'punct', 'advcl', 'dobj', 'preconj', 'amod', 'cc', 'conj', 'aux', 'advcl', 'det', 'amod', 'dobj', 'mark', 'punct', 'compound', 'nsubj', 'punct', 'aux', 'relcl', 'punct'],
'tags': ['PRP', 'VBZ', 'DT', 'RB', 'JJ', 'NN', ',', 'VBG', 'NNS', 'CC', 'JJ', 'CC', 'JJ', 'TO', 'VB', 'DT', 'JJ', 'NN', 'IN', '``', 'NNP', 'NNS', "''", 'MD', 'VB', '.'],
'cats': {'POSITIVE': True, 'NEGATIVE': False}
})

  1. What does the number mean in 'entities'?

  2. Do you have a document to read what'heads','deps','tags' and'cats' are?

 Thanks for reading.

@d5555
Copy link
Owner

d5555 commented Aug 13, 2020

  1. numbers in 'entities' mean character offsets within the sentence.
  2. you will find your answers in spacy documentation / training

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants