Skip to content

djbpitt/russian-nlp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

36 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

russian-nlp

About

Playground for Russian NLP packages.

Tested under Python 3.9.9 Anaconda miniforge Apple arm (M1)

MyStem

Install https://pypi.org/project/pymystem3/. Installing the latest version, as described on that page, errors out, but the current stable version works as advertised.

Notes:

  1. It is not necessary to install MyStem separately; the wrapper package fetches it. If desired, though, a command-line executable for MyStem can be downloading from https://download.cdn.yandex.net/mystem/mystem-3.1-macosx.tar.gz. Untar with tar -xvf mystem-3.1-macosx.tar.gz, which unpacks an executable called mystem. Remove quarantine with xattr -d com.apple.quarantine mystem and run from the command line.
  2. Downloading the command-line executable failed on 2022-03-12 from the link on the main MyStem page (https://yandex.ru/dev/mystem/), but worked from the direct link above.
  3. The executable is for Apple Intel, but works under M1.

spaCy

There are three Russian models (small, medium, large), all trained on news data. Install spaCy (https://spacy.io/) and then the Russian models (https://spacy.io/models/ru).

See also spaCy + Stanza (formerly StanfordNLP) https://github.com/explosion/spacy-stanza, a wrapper for using Stanford models (https://stanfordnlp.github.io/stanfordnlp/) from inside spaCy.

A custom tokenizer for adjusting errata in spaCy's tokenization of Russian: https://github.com/aatimofeev/spacy_russian_tokenizer

Slovnet

Slovnet (https://github.com/natasha/slovnet) is part of the Natasha project (https://github.com/natasha).

DeepPavlov

DeepPavlov (https://github.com/deepmipt/DeepPavlov) is designed for development of production ready chat-bots and complex conversational systems, research in the area of NLP and, particularly, of dialog systems.


Links to other resources

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published