Skip to content

Ubuntu系统下,利用wiki中文语料训练word2vec模型

Notifications You must be signed in to change notification settings

Jacen789/zhwiki_word2vec

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

zhwiki_word2vec

Ubuntu系统下,利用wiki中文语料训练word2vec模型

开发环境配置

  • sudo aptitude install opencc (之所以不用opencc-python是因为太慢了)
  • 安装anaconda3
  • pip install -U gensim
  • pip install -U cjieba

语料下载

实现步骤

  • 先提取语料
  • 再进行繁体转简体
  • 再进行分词
  • 然后训练word2vec模型
  • 最后测试模型效果

About

Ubuntu系统下,利用wiki中文语料训练word2vec模型

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published