Skip to content
Vietnamese language model for spacy.io
Branch: master
Clone or download
Latest commit a32de27 Jul 10, 2019
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
language_data init Jul 10, 2019
packages/vi_spacy_model-0.2.0 init Jul 10, 2019
vi/vi init Jul 10, 2019
README.md init Jul 10, 2019
prepare_language_model.ipynb init Jul 10, 2019
train_word_vectors.py init Jul 10, 2019
vi-vocab-data.jsonl init Jul 10, 2019

README.md

vivi_spacy (UPDATED, WORK for SPACY Verson > 2.1!!!)

Vivi_spacy contains Vietnamese models for spaCy. We trained word2vec on a combination of wikipedia and news corpus, vector size = 128. Pos tagger and DEP parser are trained on UD Vietnamese (http://universaldependencies.org/treebanks/vi/index.html)

Installation

  1. Download vivi model directly using pip:
pip install https://github.com/trungtv/vivi_spacy/raw/master/packages/vi_spacy_model-0.2.0/dist/vi_spacy_model-0.2.0.tar.gz
  1. You may need to install pyvi
    pip install pyvi 

Usage: import as module

import spacy
nlp = spacy.load('vi_spacy_model')
doc = nlp('Cộng đồng xử lý ngôn ngữ tự nhiên'))
You can’t perform that action at this time.