Skip to content

veridic/docker-syntaxnet_rus

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

53 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Syntaxnet for Russian

Google's SyntaxNet Parser and POS tagger with a model for Russian language.

Usage


1. Single parse using shell:

echo "мама мыла раму" | docker run --rm -i inemo/syntaxnet_rus
...
Input: Name this boat
Parse (CONLL format):
1       мама    _       NOUN    _       Animacy=Anim|Case=Nom|Gender=Fem|Number=Sing|fPOS=NOUN++        2       nsubj   _       _
2       мыла    _       VERB    _       Aspect=Imp|Gender=Fem|Mood=Ind|Number=Sing|Tense=Past|VerbForm=Fin|Voice=Act|fPOS=VERB++        0  ROOT     _       _
3       раму    _       NOUN    _       Animacy=Inan|Case=Acc|Gender=Fem|Number=Sing|fPOS=NOUN++        2       dobj    _       _

2. Standalone SyntaxNet server that does not recreate models (stays alive) (unstable):

docker run --shm-size=1024m -ti --rm -p 8111:9999 inemo/syntaxnet_eng server 0.0.0.0 9999

Note that, although the current container installs model for Russian, the implemented server can be used for any language (any model trained in SyntaxNet).

2.1 You also can use the server in conjunction with SyntaxNet python wrapper.

2.2 You can use telnet to talk with parser (be aware about escape problems of unicode in telnet, e.g., 'маму' will not work by default via telnet):

telnet localhost 8111
мама мыла
1       мама    _       NOUN    _       Animacy=Anim|Case=Nom|Gender=Fem|Number=Sing|fPOS=NOUN++        2       nsubj   _       _
2       мыла    _       VERB    _       Aspect=Imp|Gender=Fem|Mood=Ind|Number=Sing|Tense=Past|VerbForm=Fin|Voice=Act|fPOS=VERB++        0  ROOT     _       _

Updating


cd <project_path>/docker/
./build.sh
#docker login
#docker build -t inemo/syntaxnet_rus --no-cache . && docker push inemo/syntaxnet_rus

About

Dockerized version of Google's SyntaxNet Parser and POS tagger for Russian + standalone server.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Dockerfile 84.7%
  • Shell 15.3%