Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What is the meaning of entity column #23

Closed
hanjie0 opened this issue Jul 5, 2019 · 5 comments
Closed

What is the meaning of entity column #23

hanjie0 opened this issue Jul 5, 2019 · 5 comments

Comments

@hanjie0
Copy link

hanjie0 commented Jul 5, 2019

in the train.csv. the typical format of an entity is
[['Q8029103', 139, 143, 0.5], [......]]
Here 'Q8029103' is the identifier of the entity.
what is the meaning for 139, 143, 0.5 ?

@zzy14
Copy link
Member

zzy14 commented Jul 5, 2019

The second number is the beginning of the entity mention, the third number is the end of the entity mention, and the fourth number is the score of entity linking given by the linking model.

@zzy14 zzy14 closed this as completed Jul 5, 2019
@hanjie0
Copy link
Author

hanjie0 commented Jul 5, 2019

Dear sir, may I ask you to give me more details?
"The second number is the beginning of the entity mention"
do you mean the location of entity in the sentence, which is the the content of 'text' column.
"the fourth number is the score of entity linking given by the linking model."
Which linking model ?
Anyway, if I would like to create such a file in the format of 'train.json', I do need the details to make my scripts

@zzy14
Copy link
Member

zzy14 commented Jul 5, 2019

Please refer to code/example.py for the detail of preparation.

@hanjie0
Copy link
Author

hanjie0 commented Jul 5, 2019

from the code/example.py. I think the a.begin, a.end, a.score are returned by tagme.
But the values (a.begin, a.end, a.score) are used for the further ERNIE training and testing or not?

@hanjie0
Copy link
Author

hanjie0 commented Jul 7, 2019

forgive my poor background. Again. for a.begin, a.end, they are location of the entity in the sentence ?
and if they are used for ERNIE training and testing?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants