Skip to content

Latest commit

 

History

History
20 lines (19 loc) · 246 Bytes

README.md

File metadata and controls

20 lines (19 loc) · 246 Bytes

mynlp

nlp 脚手架,内置常用的nlp操作

preprocess

english

  1. 字符串规范化
  2. 分句
  3. 分词
  4. 提取词干

chinese

  1. 分句
  2. 分词

feature

english

  1. tfidf
  2. word2vec

chinese

  1. tfidf
  2. word2vec