Focus keyword is not recognized in Chinese because there are no spaces between words #145

Closed
kendraschaefer opened this Issue Jul 30, 2013 · 17 comments

Comments

Projects
None yet

In the Chinese (and Japanese) languages, there are no spaces between words. So, if a focus keyword is written as part of a sentence in the content, or the middle of a longer title, it is not recognized by the plugin. In order to be recognized, it either has to be on a line by itself, or you have to place a space immediately before and after it, which looks very strange to native Chinese / Japanese readers.

To reproduce, set this as your focus keyword:

我的朋友

(Meaning: My friend)

Then set this as your title / place in content / url:

我的朋友叫小王

(Meaning: My friend is named XiaoWang)

Contributor

jrfnl commented Feb 27, 2014

@kendraschaefer Could you please provide me with a number of short sample texts + keywords + expected keyword count for testing purposes ?

铅笔的历史非常悠久,它起源于2000多年前的古罗马时期。那时的铅笔很_简陋_,只不过是金属套里夹着的一根铅棒,甚至是铅块,倒真是_名副其实_的“铅”笔。
而我们今天使用的铅笔是用石墨和黏土制成的,里面并不含铅。

Keyword: 铅笔

Expected count: 3

On Fri, Feb 28, 2014 at 4:26 AM, Juliette notifications@github.com wrote:

@kendraschaefer https://github.com/kendraschaefer Could you please
provide me with a number of short sample texts + keywords + expected
keyword count for testing purposes ?


Reply to this email directly or view it on GitHubhttps://github.com/Yoast/wordpress-seo/issues/145#issuecomment-36287169
.

Contributor

jrfnl commented Mar 2, 2014

@kendraschaefer Thanks for that. I don't have a solution at the moment, but will keep this in the back of my mind. You are of course correct in saying that the spaces are part of the issue, though there is more to it with ideograph based languages.

I got the same problem with Polish language. I updated to latest 1.5.5 but it doesn't work. If keyword contain polish letters like ążółńć it doesn't count density....
I hope somebody will fix it finally because it doesn't work from 1.4.5-8 version I think... so its long time bug....

Thanks

shnssk commented Feb 6, 2015

I have the same problem in Japanese. I hope this would be solved soon. It's a shame that we are missing the chance to use the great function. It's is sad to keep being told that your SEO is poor... even the title and article contains keywords...

seripap pushed a commit to seripap/wordpress-seo that referenced this issue Nov 19, 2015

Merge pull request #145 from Yoast/stories/DT/keywordregex-143
Add wordboundaries to regex to make sure matching is correct.

AskKim commented Feb 4, 2016

Please inform the customer of conversation # 99154 when this conversation has been closed.

Contributor

Rarst commented Apr 22, 2016

This is an old issue and there are reports here about different languages over very different plugin versions.

Please let us know if current version of the plugin still exhibits the issue with your language.

@Rarst Rarst closed this May 9, 2016

I still have this issue, anyone working on this?

Contributor

Rarst commented Aug 12, 2016

@paulovsky what language exactly do you have an issue with? Are you using the most current plugin version?

paulovsky commented Aug 14, 2016

@Rarst I'm using WordPress 4.5.3 and Yoast Premium 3.4.2.

I'm managing a bilingual website in both English and mandarin Chinese. We have no problems with the English SEO, however, with the Chinese language, it seems that Yoast only "sees" the English words. For example, in a full Chinese text with 2k+ characters, Yoast only see 50 words, which are just English expressions or numerals scattered throughout the text.

Is this a core problem/bad config with WordPress, related to the fact that it is a bilingual website?

Any advice would be gold. Thanks!

Yup, same problem. Would be awesome to fix this.

On Sun, Aug 14, 2016 at 12:08 AM, paulovsky notifications@github.com
wrote:

@Rarst https://github.com/Rarst I'm using WordPress 4.5.3 and Yoast
Premium 3.4.2.

I'm managing a bilingual website in both English and mandarin Chinese. We
have no problems with the English SEO, however, with the Chinese language,
it seems that Yoast only "sees" the English words. For example, in a full
Chinese text with 2k+ characters, Yoast only see 50 words, which are just
English expressions or Arabic numerals scattered throughout the text.

Is this a core problem/bad config with WordPress, related to the fact that
it is a bilingual website, or that the admin area is in English?

Any advice would be gold. Thanks!


You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
#145 (comment),
or mute the thread
https://github.com/notifications/unsubscribe-auth/AE4YgRLnyuC_uz7JgKkYU3TvYFja0SN9ks5qfr8FgaJpZM4A226V
.

@Rarst can you please reopen this issue? I'm afraid it's not solved.

@Rarst Rarst reopened this Aug 15, 2016

@Rarst Rarst added bug and removed wait for feedback labels Aug 15, 2016

ullivr commented Aug 21, 2016

same problem. yoast is useless to me because of this problem. i'm wrote my blog in chinese.

Please inform the customer of conversation # 148759 when this conversation has been closed.

Contributor

terw-dan commented Dec 6, 2016

Support for Chinese/Japanese and other syllabic languages should be build in YoastSEO.js
This will be picked up in Yoast/YoastSEO.js#761

Other languages like Polish are supported at this moment.

@terw-dan terw-dan closed this Dec 6, 2016

joe528 commented Dec 30, 2016

The problem still exists.

Supporting Chinese/Japanese doesn't mean you can show Chinese/Japanese texts in your plugin UI.

The problem stated as the title "Focus keyword is not recognized in Chinese because there are no spaces between words" still exists.

For example, an title like this:

"學英文的三大動力"

And the focus keyword is

"學英文" (as you can see, exactly the first three Chinese characters in the title)

Yoast is still saying "The focus keyword '學英文' does not appear in the SEO title." as shown in the screenshot here:

screen shot 2016-12-30 at 11 53 35 am

The article has over 300 words, and Yoast still complains "The text contains 54 words. This is far below the recommended minimum of 300 words. Add more content that is relevant for the topic."

Those problem still exist with the latest version of Yoast (Version 4.0.2) and they have never been fixed (I have been using Yoast over 2 years and I never see them get fixed).

Please reopen this bug and help.

Contributor

terw-dan commented Dec 30, 2016

This issue was closed because it is something that will be adressed here: Yoast/YoastSEO.js#761

To avoid duplication of issues, this is closed. But that doesn't mean it is not something we want to implement in upcoming releases.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment