Whether it works in Chinese Word Segmentation #17

hzylmf · 2017-11-02T06:22:01Z

Thank for your code. I wanna use this code for Chinese Word Segmentation, so does it work for applying the code to my word segmentation task?

LiyuanLucasLiu · 2017-11-02T20:49:07Z

Thanks:-)

The current code cannot work for Chinese, but you can definitely modified it a little to make it work on Chinese.

Basically, i would recommended you to use our word-level model, treat each Chinese character as a word, and modify the encoding for read & write. Also, pre-trained embeddings are crucial for performance, i would also recommend you to get some character-level embedding for Chinese (there are several papers about this).

Besides, you could also try to represent Chinese Characters by Wubi or Pingying, and treat them as character-level representation.

LiyuanLucasLiu closed this as completed Nov 2, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Whether it works in Chinese Word Segmentation #17

Whether it works in Chinese Word Segmentation #17

hzylmf commented Nov 2, 2017

LiyuanLucasLiu commented Nov 2, 2017

Whether it works in Chinese Word Segmentation #17

Whether it works in Chinese Word Segmentation #17

Comments

hzylmf commented Nov 2, 2017

LiyuanLucasLiu commented Nov 2, 2017