Skip to content
World of Warcraft Chinese Dictionary
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
LICENSE
README.md
dict.inc

README.md

World of Warcraft Chinese Dictionary

This dictionary is discovered by word extraction algorithm from all zh_cn locale strings exported from database client. This is the raw output, not revised by human, so the precision and recall rate are without any warranty.

The dict is encoded as UTF-8(with BOM), tab-delimited. Line ends with single LF(Unix style).

The first column is the word, the second column is the frequency, the third column is the coherency, the last column is the entropy.

Frequency indicates how many times this word appeared.

Coherency indicates how close are characters coupled in this word. E.g. “玫瑰” will get a high coherency because each time “玫” appears, it's always followed by “瑰”.

Entropy indicates the freedom of this word's usage, how rich are the contexts may be around the word. E.g. “费伍德” will get a low entropy because it could only be used in few contexts, mainly in “费伍德森林”.

Extractor could be found at aleph repository.

You can’t perform that action at this time.