Skip to content
This repository has been archived by the owner on Feb 13, 2021. It is now read-only.

Question about Chinese entity linking #11

Open
sxyao opened this issue Jun 17, 2017 · 3 comments
Open

Question about Chinese entity linking #11

sxyao opened this issue Jun 17, 2017 · 3 comments

Comments

@sxyao
Copy link

sxyao commented Jun 17, 2017

Is "mvn exec:java -Dexec.mainClass=com.yahoo.semsearch.fastlinking.FastEntityLinker -Dexec.args=“zh/chinese-dec15.hash" the right command to do fastlinking of Chinese?

I run that command and got into the interactive shell. But when I input some sentence, it does not shows the entities. I tried Spanish, and the same thing happened. What could be the problem? Thanks a lot!

@aasish
Copy link
Contributor

aasish commented Jul 5, 2017

@sxyao Can you copy paste your command's output?

@liehe
Copy link

liehe commented Jul 13, 2017

@aasish The problem comes from that fact that Chinese words/phrases are not seperated by space.
For example,

I live in the New York city.
我住在纽约市。

However, if I pass 我住在纽约市 directly to FEL, there is no entity found. If I chunk the sentence by myself

I live in the New York city.
我 住在 纽约市

then FEL will return entities like new york city.

@aasish
Copy link
Contributor

aasish commented Nov 5, 2017

@LiamHe The chinese model expects tokenized text. Please feel free to create a pull request to handle chinese tokenization.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants