Skip to content

Releases: explosion/spacy-models

zh_core_web_trf-3.5.0

19 Jan 12:35
b40a197
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: cb9641d03c5b357ad3ed538af36c3b48d3a90af2c7218b8504e859fd8fb5843d
Checksum .whl: b1e06addd5ba03811d2b8a6cf929541fb7495cb6dba985fd6f4ce0f9b9b51495

Details: https://spacy.io/models/zh#zh_core_web_trf

Chinese transformer pipeline (bert-base-chinese). Components: transformer, tagger, parser, ner, attribute_ruler.

Feature Description
Name zh_core_web_trf
Version 3.5.0
spaCy >=3.5.0,<3.6.0
Default Pipeline transformer, tagger, parser, attribute_ruler, ner
Components transformer, tagger, parser, attribute_ruler, ner
Vectors 0 keys, 0 unique vectors (0 dimensions)
Sources OntoNotes 5 (Ralph Weischedel, Martha Palmer, Mitchell Marcus, Eduard Hovy, Sameer Pradhan, Lance Ramshaw, Nianwen Xue, Ann Taylor, Jeff Kaufman, Michelle Franchini, Mohammed El-Bachouti, Robert Belvin, Ann Houston)
CoreNLP Universal Dependencies Converter (Stanford NLP Group)
bert-base-chinese (Hugging Face)
License MIT
Author Explosion
Model size 398 MB

Label Scheme

View label scheme (99 labels for 3 components)
Component Labels
tagger AD, AS, BA, CC, CD, CS, DEC, DEG, DER, DEV, DT, ETC, FW, IJ, INF, JJ, LB, LC, M, MSP, NN, NR, NT, OD, ON, P, PN, PU, SB, SP, URL, VA, VC, VE, VV, X
parser ROOT, acl, advcl:loc, advmod, advmod:dvp, advmod:loc, advmod:rcomp, amod, amod:ordmod, appos, aux:asp, aux:ba, aux:modal, aux:prtmod, auxpass, case, cc, ccomp, compound:nn, compound:vc, conj, cop, dep, det, discourse, dobj, etc, mark, mark:clf, name, neg, nmod, nmod:assmod, nmod:poss, nmod:prep, nmod:range, nmod:tmod, nmod:topic, nsubj, nsubj:xsubj, nsubjpass, nummod, parataxis:prnmod, punct, xcomp
ner CARDINAL, DATE, EVENT, FAC, GPE, LANGUAGE, LAW, LOC, MONEY, NORP, ORDINAL, ORG, PERCENT, PERSON, PRODUCT, QUANTITY, TIME, WORK_OF_ART

Accuracy

Type Score
TOKEN_ACC 95.85
TOKEN_P 94.58
TOKEN_R 91.36
TOKEN_F 92.94
TAG_ACC 92.48
SENTS_P 71.45
SENTS_R 65.44
SENTS_F 68.31
DEP_UAS 76.85
DEP_LAS 72.99
ENTS_P 75.20
ENTS_R 76.55
ENTS_F 75.87

Installation

pip install spacy
python -m spacy download zh_core_web_trf

zh_core_web_sm-3.5.0

19 Jan 12:35
b40a197
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: b44ee21b09dbb37862b9c9152c02a39d0541561ea5d23b81dfa6641f97956a58
Checksum .whl: 50ec664cef64a095dd9ec7b9a1dba76814db10d72e4024249f9105ace8cb5ffa

Details: https://spacy.io/models/zh#zh_core_web_sm

Chinese pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler.

Feature Description
Name zh_core_web_sm
Version 3.5.0
spaCy >=3.5.0,<3.6.0
Default Pipeline tok2vec, tagger, parser, attribute_ruler, ner
Components tok2vec, tagger, parser, senter, attribute_ruler, ner
Vectors 0 keys, 0 unique vectors (0 dimensions)
Sources OntoNotes 5 (Ralph Weischedel, Martha Palmer, Mitchell Marcus, Eduard Hovy, Sameer Pradhan, Lance Ramshaw, Nianwen Xue, Ann Taylor, Jeff Kaufman, Michelle Franchini, Mohammed El-Bachouti, Robert Belvin, Ann Houston)
CoreNLP Universal Dependencies Converter (Stanford NLP Group)
License MIT
Author Explosion
Model size 46 MB

Label Scheme

View label scheme (100 labels for 3 components)
Component Labels
tagger AD, AS, BA, CC, CD, CS, DEC, DEG, DER, DEV, DT, ETC, FW, IJ, INF, JJ, LB, LC, M, MSP, NN, NR, NT, OD, ON, P, PN, PU, SB, SP, URL, VA, VC, VE, VV, X, _SP
parser ROOT, acl, advcl:loc, advmod, advmod:dvp, advmod:loc, advmod:rcomp, amod, amod:ordmod, appos, aux:asp, aux:ba, aux:modal, aux:prtmod, auxpass, case, cc, ccomp, compound:nn, compound:vc, conj, cop, dep, det, discourse, dobj, etc, mark, mark:clf, name, neg, nmod, nmod:assmod, nmod:poss, nmod:prep, nmod:range, nmod:tmod, nmod:topic, nsubj, nsubj:xsubj, nsubjpass, nummod, parataxis:prnmod, punct, xcomp
ner CARDINAL, DATE, EVENT, FAC, GPE, LANGUAGE, LAW, LOC, MONEY, NORP, ORDINAL, ORG, PERCENT, PERSON, PRODUCT, QUANTITY, TIME, WORK_OF_ART

Accuracy

Type Score
TOKEN_ACC 95.85
TOKEN_P 94.58
TOKEN_R 91.36
TOKEN_F 92.94
TAG_ACC 89.33
SENTS_P 77.85
SENTS_R 72.62
SENTS_F 75.14
DEP_UAS 69.60
DEP_LAS 64.08
ENTS_P 72.03
ENTS_R 64.93
ENTS_F 68.30

Installation

pip install spacy
python -m spacy download zh_core_web_sm

zh_core_web_md-3.5.0

19 Jan 12:35
b40a197
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: 6ae3d5a56229e3b5fb287ca1f8fa23451a55e52897c59f5b0814d9be5bd71d0f
Checksum .whl: 9daae015e8a5be9e4914cd9ac722d5f9a0d1d41b6f19c5b1c961cc576582103a

Details: https://spacy.io/models/zh#zh_core_web_md

Chinese pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler.

Feature Description
Name zh_core_web_md
Version 3.5.0
spaCy >=3.5.0,<3.6.0
Default Pipeline tok2vec, tagger, parser, attribute_ruler, ner
Components tok2vec, tagger, parser, senter, attribute_ruler, ner
Vectors 500000 keys, 20000 unique vectors (300 dimensions)
Sources OntoNotes 5 (Ralph Weischedel, Martha Palmer, Mitchell Marcus, Eduard Hovy, Sameer Pradhan, Lance Ramshaw, Nianwen Xue, Ann Taylor, Jeff Kaufman, Michelle Franchini, Mohammed El-Bachouti, Robert Belvin, Ann Houston)
CoreNLP Universal Dependencies Converter (Stanford NLP Group)
Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia) (Explosion)
License MIT
Author Explosion
Model size 74 MB

Label Scheme

View label scheme (100 labels for 3 components)
Component Labels
tagger AD, AS, BA, CC, CD, CS, DEC, DEG, DER, DEV, DT, ETC, FW, IJ, INF, JJ, LB, LC, M, MSP, NN, NR, NT, OD, ON, P, PN, PU, SB, SP, URL, VA, VC, VE, VV, X, _SP
parser ROOT, acl, advcl:loc, advmod, advmod:dvp, advmod:loc, advmod:rcomp, amod, amod:ordmod, appos, aux:asp, aux:ba, aux:modal, aux:prtmod, auxpass, case, cc, ccomp, compound:nn, compound:vc, conj, cop, dep, det, discourse, dobj, etc, mark, mark:clf, name, neg, nmod, nmod:assmod, nmod:poss, nmod:prep, nmod:range, nmod:tmod, nmod:topic, nsubj, nsubj:xsubj, nsubjpass, nummod, parataxis:prnmod, punct, xcomp
ner CARDINAL, DATE, EVENT, FAC, GPE, LANGUAGE, LAW, LOC, MONEY, NORP, ORDINAL, ORG, PERCENT, PERSON, PRODUCT, QUANTITY, TIME, WORK_OF_ART

Accuracy

Type Score
TOKEN_ACC 95.85
TOKEN_P 94.58
TOKEN_R 91.36
TOKEN_F 92.94
TAG_ACC 90.04
SENTS_P 78.89
SENTS_R 72.80
SENTS_F 75.72
DEP_UAS 70.50
DEP_LAS 65.22
ENTS_P 71.88
ENTS_R 67.90
ENTS_F 69.83

Installation

pip install spacy
python -m spacy download zh_core_web_md

zh_core_web_lg-3.5.0

19 Jan 12:35
b40a197
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: e2373a8505fbde3f178fc1e0fd0a3ccb3f36b8d91f37962f19d7d3e26b79e79f
Checksum .whl: 940193b60c501b8114723bda29fa688ad409493c86384de0ed70a471cf526c18

Details: https://spacy.io/models/zh#zh_core_web_lg

Chinese pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler.

Feature Description
Name zh_core_web_lg
Version 3.5.0
spaCy >=3.5.0,<3.6.0
Default Pipeline tok2vec, tagger, parser, attribute_ruler, ner
Components tok2vec, tagger, parser, senter, attribute_ruler, ner
Vectors 500000 keys, 500000 unique vectors (300 dimensions)
Sources OntoNotes 5 (Ralph Weischedel, Martha Palmer, Mitchell Marcus, Eduard Hovy, Sameer Pradhan, Lance Ramshaw, Nianwen Xue, Ann Taylor, Jeff Kaufman, Michelle Franchini, Mohammed El-Bachouti, Robert Belvin, Ann Houston)
CoreNLP Universal Dependencies Converter (Stanford NLP Group)
Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia) (Explosion)
License MIT
Author Explosion
Model size 575 MB

Label Scheme

View label scheme (100 labels for 3 components)
Component Labels
tagger AD, AS, BA, CC, CD, CS, DEC, DEG, DER, DEV, DT, ETC, FW, IJ, INF, JJ, LB, LC, M, MSP, NN, NR, NT, OD, ON, P, PN, PU, SB, SP, URL, VA, VC, VE, VV, X, _SP
parser ROOT, acl, advcl:loc, advmod, advmod:dvp, advmod:loc, advmod:rcomp, amod, amod:ordmod, appos, aux:asp, aux:ba, aux:modal, aux:prtmod, auxpass, case, cc, ccomp, compound:nn, compound:vc, conj, cop, dep, det, discourse, dobj, etc, mark, mark:clf, name, neg, nmod, nmod:assmod, nmod:poss, nmod:prep, nmod:range, nmod:tmod, nmod:topic, nsubj, nsubj:xsubj, nsubjpass, nummod, parataxis:prnmod, punct, xcomp
ner CARDINAL, DATE, EVENT, FAC, GPE, LANGUAGE, LAW, LOC, MONEY, NORP, ORDINAL, ORG, PERCENT, PERSON, PRODUCT, QUANTITY, TIME, WORK_OF_ART

Accuracy

Type Score
TOKEN_ACC 95.85
TOKEN_P 94.58
TOKEN_R 91.36
TOKEN_F 92.94
TAG_ACC 90.33
SENTS_P 78.05
SENTS_R 72.63
SENTS_F 75.24
DEP_UAS 70.86
DEP_LAS 65.71
ENTS_P 73.55
ENTS_R 69.25
ENTS_F 71.34

Installation

pip install spacy
python -m spacy download zh_core_web_lg

xx_sent_ud_sm-3.5.0

19 Jan 12:35
b40a197
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: ff24a61397363dfa7c67006298c4d0d4ac6c5e49900d68da41e353aa67a31822
Checksum .whl: bbc9c66216e1232e144c33bf525afab525ff0e276850fd0c75762f73a7edb6b8

Details: https://spacy.io/models/xx#xx_sent_ud_sm

Multi-language pipeline optimized for CPU. Components: senter.

Feature Description
Name xx_sent_ud_sm
Version 3.5.0
spaCy >=3.5.0,<3.6.0
Default Pipeline senter
Components senter
Vectors 0 keys, 0 unique vectors (0 dimensions)
Sources Universal Dependencies v2.8 (UD_Afrikaans-AfriBooms, UD_Croatian-SET, UD_Czech-CAC, UD_Czech-CLTT, UD_Danish-DDT, UD_Dutch-Alpino, UD_Dutch-LassySmall, UD_English-EWT, UD_Finnish-FTB, UD_Finnish-TDT, UD_French-GSD, UD_French-Spoken, UD_German-GSD, UD_Indonesian-GSD, UD_Irish-IDT, UD_Italian-TWITTIRO, UD_Korean-GSD, UD_Korean-Kaist, UD_Latvian-LVTB, UD_Lithuanian-ALKSNIS, UD_Lithuanian-HSE, UD_Marathi-UFAL, UD_Norwegian-Bokmaal, UD_Norwegian-Nynorsk, UD_Norwegian-NynorskLIA, UD_Persian-Seraji, UD_Portuguese-Bosque, UD_Portuguese-GSD, UD_Romanian-Nonstandard, UD_Romanian-RRT, UD_Russian-GSD, UD_Russian-Taiga, UD_Serbian-SET, UD_Slovak-SNK, UD_Spanish-GSD, UD_Swedish-Talbanken, UD_Telugu-MTG, UD_Vietnamese-VTB) (Zeman, Daniel; Nivre, Joakim; Abrams, Mitchell; et al.)
License CC BY-SA 3.0
Author Explosion
Model size 4 MB

Label Scheme

Accuracy

Type Score
TOKEN_ACC 98.59
TOKEN_P 95.31
TOKEN_R 95.72
TOKEN_F 95.52
SENTS_P 90.66
SENTS_R 81.58
SENTS_F 85.88

Installation

pip install spacy
python -m spacy download xx_sent_ud_sm

xx_ent_wiki_sm-3.5.0

19 Jan 12:35
b40a197
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: 7d6c44535829c047b52b2dfabc720cd4f58f460df3516357b1e5aeefdfd74a10
Checksum .whl: bf6fdee3e36c36d636f56cf68aa70c02eea43cfa779f4038bab8ce12acaa0715

Details: https://spacy.io/models/xx#xx_ent_wiki_sm

Multi-language pipeline optimized for CPU. Components: ner.

Feature Description
Name xx_ent_wiki_sm
Version 3.5.0
spaCy >=3.5.0,<3.6.0
Default Pipeline ner
Components ner
Vectors 0 keys, 0 unique vectors (0 dimensions)
Sources WikiNER (Joel Nothman, Nicky Ringland, Will Radford, Tara Murphy, James R Curran)
License MIT
Author Explosion
Model size 10 MB

Label Scheme

View label scheme (4 labels for 1 components)
Component Labels
ner LOC, MISC, ORG, PER

Accuracy

Type Score
ENTS_P 83.53
ENTS_R 82.65
ENTS_F 83.08

Installation

pip install spacy
python -m spacy download xx_ent_wiki_sm

uk_core_news_trf-3.5.0

19 Jan 12:35
b40a197
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: cb21c1384506dba79be33c0c39fd83e66d9ed24121d03d5acf2e52b523db7009
Checksum .whl: 17e27632d4ba8d90923fede131de3b527f7e2e916021bf1447b56eb9a61db2c9

Details: https://spacy.io/models/uk#uk_core_news_trf

Ukrainian transformer pipeline (ukr-models/xlm-roberta-base-uk). Components: transformer, morphologizer, parser, ner, attribute_ruler, lemmatizer.

Feature Description
Name uk_core_news_trf
Version 3.5.0
spaCy >=3.5.0,<3.6.0
Default Pipeline transformer, morphologizer, parser, attribute_ruler, lemmatizer, ner
Components transformer, morphologizer, parser, attribute_ruler, lemmatizer, ner
Vectors 0 keys, 0 unique vectors (0 dimensions)
Sources Ukr-Synth (e5d9eaf3) (Volodymyr Kurnosov)
ukr-models/xlm-roberta-base-uk (Volodymyr Kurnosov and Yinhan Liu and Myle Ott and Naman Goyal and Jingfei Du and Mandar Joshi and Danqi Chen and Omer Levy and Mike Lewis and Luke Zettlemoyer and Veselin Stoyanov)
License MIT
Author Explosion
Model size 394 MB

Label Scheme

View label scheme (1210 labels for 3 components)
Component Labels
morphologizer POS=CCONJ, Degree=Cmp|POS=ADV, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Animacy=Inan|Case=Nom|Gender=Fem|Number=Plur|POS=NOUN, Animacy=Inan|Case=Gen|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Inan|Case=Ins|Gender=Fem|Number=Sing|POS=NOUN, POS=PUNCT, Case=Gen|Number=Plur|POS=DET|PronType=Dem, Animacy=Inan|Case=Gen|Gender=Fem|Number=Plur|POS=NOUN, POS=ADV|PronType=Rel, POS=PART, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Tense=Past|VerbForm=Fin, Aspect=Imp|POS=VERB|VerbForm=Inf, Animacy=Inan|Case=Nom|Gender=Masc|Number=Plur|POS=NOUN, Animacy=Anim|Case=Nom|Gender=Masc|Number=Plur|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Masc|Number=Plur|POS=NOUN, Case=Loc|POS=ADP, Case=Loc|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Inan|Case=Loc|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Anim|Case=Nom|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Anim|Case=Nom|Gender=Masc|NameType=Giv|Number=Sing|POS=PROPN, Animacy=Anim|Case=Nom|Gender=Masc|NameType=Sur|Number=Sing|POS=PROPN, POS=ADV, Aspect=Imp|Gender=Masc|Mood=Ind|Number=Sing|POS=VERB|Tense=Past|VerbForm=Fin, Animacy=Inan|Case=Loc|Gender=Masc|Number=Plur|POS=NOUN, Case=Gen|POS=ADP, Animacy=Inan|Case=Gen|Gender=Neut|Number=Sing|POS=PRON|PronType=Dem, Case=Loc|Gender=Masc|NumType=Ord|Number=Sing|POS=ADJ|Uninflect=Yes, Abbr=Yes|Animacy=Inan|Case=Loc|Gender=Masc|Number=Sing|POS=NOUN|Uninflect=Yes, Case=Nom|NumType=Card|POS=DET|PronType=Ind, Animacy=Anim|Case=Gen|Gender=Masc|Number=Plur|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Neut|Number=Sing|POS=NOUN, Case=Gen|Number=Plur|POS=ADJ, Animacy=Inan|Case=Gen|Gender=Neut|Number=Plur|POS=NOUN, Case=Loc|Number=Plur|POS=ADJ, POS=SCONJ, Aspect=Imp|Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Aspect=Perf|POS=VERB|VerbForm=Inf, Degree=Pos|POS=ADV, Aspect=Imp|Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Animacy=Anim|Case=Nom|Number=Plur|POS=PRON|Person=2|PronType=Prs, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=2|Tense=Fut|VerbForm=Fin, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=DET|Poss=Yes|PronType=Prs|Reflex=Yes, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=NOUN, Case=Loc|Gender=Neut|Number=Sing|POS=DET|PronType=Dem, Animacy=Inan|Case=Loc|Gender=Neut|Number=Sing|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=DET|PronType=Dem, Animacy=Inan|Case=Acc|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=ADJ, Aspect=Perf|Mood=Ind|POS=VERB|Person=0|VerbForm=Fin, Case=Gen|Gender=Masc|NumType=Ord|Number=Sing|POS=ADJ|Uninflect=Yes, Animacy=Inan|Case=Loc|Gender=Fem|Number=Sing|POS=PROPN, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Animacy=Anim|Case=Acc|Number=Plur|POS=DET|PronType=Tot, POS=PART|Polarity=Neg, Animacy=Inan|Case=Gen|Gender=Neut|Number=Plur|POS=NOUN|Uninflect=Yes, Animacy=Inan|Case=Gen|Gender=Fem|Number=Sing|POS=NOUN, POS=PUNCT|PunctType=Quot, POS=PUNCT|PunctType=Dash, Aspect=Perf|Gender=Masc|Mood=Ind|Number=Sing|POS=VERB|Tense=Past|VerbForm=Fin, POS=ADV|PronType=Dem, Animacy=Inan|Case=Nom|Gender=Fem|Number=Sing|POS=NOUN, Animacy=Inan|Case=Gen|Gender=Fem|Number=Sing|POS=PROPN, Case=Acc|POS=ADP, Animacy=Inan|Case=Acc|Gender=Fem|Number=Sing|POS=NOUN, Case=Gen|Gender=Masc|Number=Sing|POS=DET|PronType=Ind, Aspect=Perf|Case=Gen|Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Animacy=Inan|Case=Gen|Gender=Neut|Number=Sing|POS=NOUN, Foreign=Yes|POS=X, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Tense=Past|VerbForm=Fin, Case=Ins|POS=ADP, Animacy=Inan|Case=Ins|Gender=Fem|Number=Plur|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Neut|Number=Sing|POS=PRON|PronType=Dem, Case=Nom|Number=Plur|POS=ADJ, Animacy=Anim|Case=Nom|Gender=Fem|Number=Plur|POS=NOUN, Abbr=Yes|Animacy=Inan|Case=Gen|Gender=Neut|Number=Sing|POS=NOUN|Uninflect=Yes, Animacy=Inan|Case=Nom|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Inan|Case=Nom|Gender=Fem|Number=Sing|POS=PROPN, Animacy=Inan|Case=Acc|Number=Ptan|POS=NOUN, Case=Nom|Number=Plur|POS=DET|PronType=Rel, Case=Ins|Number=Plur|POS=PRON|Person=3|PronType=Prs, Aspect=Imp|Mood=Ind|Number=Plur|POS=AUX|Tense=Past|VerbForm=Fin, Aspect=Perf|Case=Nom|Number=Plur|POS=ADJ|VerbForm=Part|Voice=Pass, Aspect=Perf|Case=Nom|Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Case=Nom|Number=Plur|POS=DET|Person=3|Poss=Yes|PronType=Prs|Uninflect=Yes, Animacy=Inan|Case=Nom|Gender=Neut|Number=Plur|POS=NOUN, Aspect=Imp|Gender=Masc|Mood=Ind|Number=Sing|POS=AUX|Tense=Past|VerbForm=Fin, Case=Ins|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Inan|Case=Dat|POS=PRON|PronType=Neg, Case=Nom|Degree=Pos|Number=Plur|POS=ADJ, Animacy=Anim|Case=Acc|Gender=Masc|Number=Plur|POS=NOUN, Case=Nom|Gender=Fem|Number=Sing|POS=DET|PronType=Tot, Case=Ins|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Inan|Case=Ins|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Inan|Case=Nom|Gender=Neut|Number=Sing|POS=PRON|PronType=Dem, Case=Nom|Gender=Masc|Number=Sing|POS=ADJ, Case=Gen|Degree=Pos|Number=Plur|POS=ADJ, Aspect=Perf|Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin, Case=Ins|Gender=Masc|Number=Sing|POS=ADJ, Aspect=Perf|POS=VERB|Tense=Past|VerbForm=Conv, Animacy=Inan|Case=Acc|Gender=Fem|Number=Plur|POS=NOUN, Aspect=Imp|Case=Gen|Gender=Neut|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin, Case=Acc|Degree=Cmp|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Inan|Case=Loc|Gender=Fem|Number=Sing|POS=NOUN, Animacy=Inan|Case=Acc|Degree=Pos|Number=Plur|POS=ADJ, Case=Loc|Degree=Pos|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Anim|Case=Gen|Number=Plur|POS=PRON|Person=2|PronType=Prs, Case=Nom|NumType=Card|POS=DET|PronType=Dem, Animacy=Anim|Case=Gen|Number=Ptan|POS=NOUN, Animacy=Inan|Case=Loc|Gender=Masc|Number=Sing|POS=PROPN, Case=Gen|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Anim|Case=Acc|Gender=Fem|Number=Sing|POS=NOUN, Aspect=Perf|Case=Gen|Number=Plur|POS=ADJ|VerbForm=Part|Voice=Pass, Case=Nom|Gender=Fem|Number=Sing|POS=PRON|Person=3|PronType=Prs, Aspect=Perf|Gender=Fem|Mood=Ind|Number=Sing|POS=VERB|Tense=Past|VerbForm=Fin, Animacy=Inan|Case=Gen|Number=Ptan|POS=NOUN, Abbr=Yes|Animacy=Anim|Case=Nom|Gender=Masc|NameType=Giv|Number=Sing|POS=PROPN|Uninflect=Yes, Abbr=Yes|Animacy=Anim|Case=Nom|Gender=Masc|NameType=Sur|Number=Sing|POS=PROPN|Uninflect=Yes, Animacy=Anim|Case=Nom|Gender=Masc|Number=Sing|POS=PRON|PronType=Int, Animacy=Inan|Case=Nom|Gender=Neut|Number=Sing|POS=PRON|PronType=Int, Case=Acc|Gender=Neut|Number=Sing|POS=ADJ, Case=Nom|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Anim|Case=Nom|Number=Plur|POS=PRON|Person=1|PronType=Prs, Animacy=Inan|Case=Acc|Number=Plur|POS=ADJ, Animacy=Inan|Case=Nom|Gender=Masc|Number=Sing|POS=PROPN|Uninflect=Yes, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Fut|VerbForm=Fin, Case=Gen|Gender=Fem|Number=Sing|POS=ADJ, Case=Acc|Gender=Neut|NumType=Ord|Number=Sing|POS=ADJ|Uninflect=Yes, Animacy=Inan|Case=Dat|Gender=Neut|Number=Sing|POS=NOUN, Case=Nom|Gender=Fem|Number=Sing|POS=DET|PronType=Rel, Animacy=Anim|Case=Gen|Gender=Fem|Number=Sing|POS=NOUN, Aspect=Perf|Case=Loc|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, `Animacy=Anim|Case=G...

uk_core_news_sm-3.5.0

19 Jan 12:35
b40a197
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: df3e5cec8bcd0031079d7a9c6bdb824e2793e2939654c24ed6db38e897ac6bb6
Checksum .whl: a93689b18910b8938f34a8b90b40ba1ef0c503053a6ae60e21120f2781e1ba32

Details: https://spacy.io/models/uk#uk_core_news_sm

Ukrainian pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attribute_ruler, lemmatizer.

Feature Description
Name uk_core_news_sm
Version 3.5.0
spaCy >=3.5.0,<3.6.0
Default Pipeline tok2vec, morphologizer, parser, attribute_ruler, lemmatizer, ner
Components tok2vec, morphologizer, parser, senter, attribute_ruler, lemmatizer, ner
Vectors 0 keys, 0 unique vectors (0 dimensions)
Sources Ukr-Synth (e5d9eaf3) (Volodymyr Kurnosov)
License MIT
Author Explosion
Model size 14 MB

Label Scheme

View label scheme (1211 labels for 3 components)
Component Labels
morphologizer POS=CCONJ, Degree=Cmp|POS=ADV, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Animacy=Inan|Case=Nom|Gender=Fem|Number=Plur|POS=NOUN, Animacy=Inan|Case=Gen|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Inan|Case=Ins|Gender=Fem|Number=Sing|POS=NOUN, POS=PUNCT, Case=Gen|Number=Plur|POS=DET|PronType=Dem, Animacy=Inan|Case=Gen|Gender=Fem|Number=Plur|POS=NOUN, POS=ADV|PronType=Rel, POS=PART, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Tense=Past|VerbForm=Fin, Aspect=Imp|POS=VERB|VerbForm=Inf, Animacy=Inan|Case=Nom|Gender=Masc|Number=Plur|POS=NOUN, Animacy=Anim|Case=Nom|Gender=Masc|Number=Plur|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Masc|Number=Plur|POS=NOUN, Case=Loc|POS=ADP, Case=Loc|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Inan|Case=Loc|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Anim|Case=Nom|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Anim|Case=Nom|Gender=Masc|NameType=Giv|Number=Sing|POS=PROPN, Animacy=Anim|Case=Nom|Gender=Masc|NameType=Sur|Number=Sing|POS=PROPN, POS=ADV, Aspect=Imp|Gender=Masc|Mood=Ind|Number=Sing|POS=VERB|Tense=Past|VerbForm=Fin, Animacy=Inan|Case=Loc|Gender=Masc|Number=Plur|POS=NOUN, Case=Gen|POS=ADP, Animacy=Inan|Case=Gen|Gender=Neut|Number=Sing|POS=PRON|PronType=Dem, Case=Loc|Gender=Masc|NumType=Ord|Number=Sing|POS=ADJ|Uninflect=Yes, Abbr=Yes|Animacy=Inan|Case=Loc|Gender=Masc|Number=Sing|POS=NOUN|Uninflect=Yes, Case=Nom|NumType=Card|POS=DET|PronType=Ind, Animacy=Anim|Case=Gen|Gender=Masc|Number=Plur|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Neut|Number=Sing|POS=NOUN, Case=Gen|Number=Plur|POS=ADJ, Animacy=Inan|Case=Gen|Gender=Neut|Number=Plur|POS=NOUN, Case=Loc|Number=Plur|POS=ADJ, POS=SCONJ, Aspect=Imp|Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Aspect=Perf|POS=VERB|VerbForm=Inf, Degree=Pos|POS=ADV, Aspect=Imp|Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Animacy=Anim|Case=Nom|Number=Plur|POS=PRON|Person=2|PronType=Prs, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=2|Tense=Fut|VerbForm=Fin, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=DET|Poss=Yes|PronType=Prs|Reflex=Yes, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=NOUN, Case=Loc|Gender=Neut|Number=Sing|POS=DET|PronType=Dem, Animacy=Inan|Case=Loc|Gender=Neut|Number=Sing|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=DET|PronType=Dem, Animacy=Inan|Case=Acc|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=ADJ, Aspect=Perf|Mood=Ind|POS=VERB|Person=0|VerbForm=Fin, Case=Gen|Gender=Masc|NumType=Ord|Number=Sing|POS=ADJ|Uninflect=Yes, Animacy=Inan|Case=Loc|Gender=Fem|Number=Sing|POS=PROPN, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Animacy=Anim|Case=Acc|Number=Plur|POS=DET|PronType=Tot, POS=PART|Polarity=Neg, Animacy=Inan|Case=Gen|Gender=Neut|Number=Plur|POS=NOUN|Uninflect=Yes, Animacy=Inan|Case=Gen|Gender=Fem|Number=Sing|POS=NOUN, POS=PUNCT|PunctType=Quot, POS=PUNCT|PunctType=Dash, Aspect=Perf|Gender=Masc|Mood=Ind|Number=Sing|POS=VERB|Tense=Past|VerbForm=Fin, POS=ADV|PronType=Dem, Animacy=Inan|Case=Nom|Gender=Fem|Number=Sing|POS=NOUN, Animacy=Inan|Case=Gen|Gender=Fem|Number=Sing|POS=PROPN, Case=Acc|POS=ADP, Animacy=Inan|Case=Acc|Gender=Fem|Number=Sing|POS=NOUN, Case=Gen|Gender=Masc|Number=Sing|POS=DET|PronType=Ind, Aspect=Perf|Case=Gen|Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Animacy=Inan|Case=Gen|Gender=Neut|Number=Sing|POS=NOUN, Foreign=Yes|POS=X, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Tense=Past|VerbForm=Fin, Case=Ins|POS=ADP, Animacy=Inan|Case=Ins|Gender=Fem|Number=Plur|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Neut|Number=Sing|POS=PRON|PronType=Dem, Case=Nom|Number=Plur|POS=ADJ, Animacy=Anim|Case=Nom|Gender=Fem|Number=Plur|POS=NOUN, Abbr=Yes|Animacy=Inan|Case=Gen|Gender=Neut|Number=Sing|POS=NOUN|Uninflect=Yes, Animacy=Inan|Case=Nom|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Inan|Case=Nom|Gender=Fem|Number=Sing|POS=PROPN, Animacy=Inan|Case=Acc|Number=Ptan|POS=NOUN, Case=Nom|Number=Plur|POS=DET|PronType=Rel, Case=Ins|Number=Plur|POS=PRON|Person=3|PronType=Prs, Aspect=Imp|Mood=Ind|Number=Plur|POS=AUX|Tense=Past|VerbForm=Fin, Aspect=Perf|Case=Nom|Number=Plur|POS=ADJ|VerbForm=Part|Voice=Pass, Aspect=Perf|Case=Nom|Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Case=Nom|Number=Plur|POS=DET|Person=3|Poss=Yes|PronType=Prs|Uninflect=Yes, Animacy=Inan|Case=Nom|Gender=Neut|Number=Plur|POS=NOUN, Aspect=Imp|Gender=Masc|Mood=Ind|Number=Sing|POS=AUX|Tense=Past|VerbForm=Fin, Case=Ins|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Inan|Case=Dat|POS=PRON|PronType=Neg, Case=Nom|Degree=Pos|Number=Plur|POS=ADJ, Animacy=Anim|Case=Acc|Gender=Masc|Number=Plur|POS=NOUN, POS=SPACE, Case=Nom|Gender=Fem|Number=Sing|POS=DET|PronType=Tot, Case=Ins|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Inan|Case=Ins|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Inan|Case=Nom|Gender=Neut|Number=Sing|POS=PRON|PronType=Dem, Case=Nom|Gender=Masc|Number=Sing|POS=ADJ, Case=Gen|Degree=Pos|Number=Plur|POS=ADJ, Aspect=Perf|Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin, Case=Ins|Gender=Masc|Number=Sing|POS=ADJ, Aspect=Perf|POS=VERB|Tense=Past|VerbForm=Conv, Animacy=Inan|Case=Acc|Gender=Fem|Number=Plur|POS=NOUN, Aspect=Imp|Case=Gen|Gender=Neut|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin, Case=Acc|Degree=Cmp|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Inan|Case=Loc|Gender=Fem|Number=Sing|POS=NOUN, Animacy=Inan|Case=Acc|Degree=Pos|Number=Plur|POS=ADJ, Case=Loc|Degree=Pos|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Anim|Case=Gen|Number=Plur|POS=PRON|Person=2|PronType=Prs, Case=Nom|NumType=Card|POS=DET|PronType=Dem, Animacy=Anim|Case=Gen|Number=Ptan|POS=NOUN, Animacy=Inan|Case=Loc|Gender=Masc|Number=Sing|POS=PROPN, Case=Gen|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Anim|Case=Acc|Gender=Fem|Number=Sing|POS=NOUN, Aspect=Perf|Case=Gen|Number=Plur|POS=ADJ|VerbForm=Part|Voice=Pass, Case=Nom|Gender=Fem|Number=Sing|POS=PRON|Person=3|PronType=Prs, Aspect=Perf|Gender=Fem|Mood=Ind|Number=Sing|POS=VERB|Tense=Past|VerbForm=Fin, Animacy=Inan|Case=Gen|Number=Ptan|POS=NOUN, Abbr=Yes|Animacy=Anim|Case=Nom|Gender=Masc|NameType=Giv|Number=Sing|POS=PROPN|Uninflect=Yes, Abbr=Yes|Animacy=Anim|Case=Nom|Gender=Masc|NameType=Sur|Number=Sing|POS=PROPN|Uninflect=Yes, Animacy=Anim|Case=Nom|Gender=Masc|Number=Sing|POS=PRON|PronType=Int, Animacy=Inan|Case=Nom|Gender=Neut|Number=Sing|POS=PRON|PronType=Int, Case=Acc|Gender=Neut|Number=Sing|POS=ADJ, Case=Nom|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Anim|Case=Nom|Number=Plur|POS=PRON|Person=1|PronType=Prs, Animacy=Inan|Case=Acc|Number=Plur|POS=ADJ, Animacy=Inan|Case=Nom|Gender=Masc|Number=Sing|POS=PROPN|Uninflect=Yes, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Fut|VerbForm=Fin, Case=Gen|Gender=Fem|Number=Sing|POS=ADJ, Case=Acc|Gender=Neut|NumType=Ord|Number=Sing|POS=ADJ|Uninflect=Yes, Animacy=Inan|Case=Dat|Gender=Neut|Number=Sing|POS=NOUN, Case=Nom|Gender=Fem|Number=Sing|POS=DET|PronType=Rel, Animacy=Anim|Case=Gen|Gender=Fem|Number=Sing|POS=NOUN, Aspect=Perf|Case=Loc|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Animacy=Anim|Case=Gen|Gender=Masc|Number=Sing|POS=NOUN, Case=Gen|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ, Case=Nom|NumType=Card|POS=NUM|Uninflect=Yes, Case=Nom|Degree=Pos|Gender=Neut|Number=Sing|POS=ADJ, Animacy=Inan|Case=Nom|Gender=Neut|Number=Sing|POS=NOUN, `Animacy=Inan|Case=Acc...

uk_core_news_md-3.5.0

19 Jan 12:35
b40a197
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: 57043482c57ebe8336c0930b387412f4aa013573b09fec83a97016955338af16
Checksum .whl: ea6d6fa820a7614ffcb2015d9dfd8d19e407ecbb35dfadd95648b026bb346ae7

Details: https://spacy.io/models/uk#uk_core_news_md

Ukrainian pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attribute_ruler, lemmatizer.

Feature Description
Name uk_core_news_md
Version 3.5.0
spaCy >=3.5.0,<3.6.0
Default Pipeline tok2vec, morphologizer, parser, attribute_ruler, lemmatizer, ner
Components tok2vec, morphologizer, parser, senter, attribute_ruler, lemmatizer, ner
Vectors floret (50000, 300)
Sources Ukr-Synth (e5d9eaf3) (Volodymyr Kurnosov)
Explosion Vectors (OSCAR 2109 + Wikipedia + OpenSubtitles + WMT News Crawl) (Explosion)
License MIT
Author Explosion
Model size 65 MB

Label Scheme

View label scheme (1211 labels for 3 components)
Component Labels
morphologizer POS=CCONJ, Degree=Cmp|POS=ADV, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Animacy=Inan|Case=Nom|Gender=Fem|Number=Plur|POS=NOUN, Animacy=Inan|Case=Gen|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Inan|Case=Ins|Gender=Fem|Number=Sing|POS=NOUN, POS=PUNCT, Case=Gen|Number=Plur|POS=DET|PronType=Dem, Animacy=Inan|Case=Gen|Gender=Fem|Number=Plur|POS=NOUN, POS=ADV|PronType=Rel, POS=PART, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Tense=Past|VerbForm=Fin, Aspect=Imp|POS=VERB|VerbForm=Inf, Animacy=Inan|Case=Nom|Gender=Masc|Number=Plur|POS=NOUN, Animacy=Anim|Case=Nom|Gender=Masc|Number=Plur|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Masc|Number=Plur|POS=NOUN, Case=Loc|POS=ADP, Case=Loc|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Inan|Case=Loc|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Anim|Case=Nom|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Anim|Case=Nom|Gender=Masc|NameType=Giv|Number=Sing|POS=PROPN, Animacy=Anim|Case=Nom|Gender=Masc|NameType=Sur|Number=Sing|POS=PROPN, POS=ADV, Aspect=Imp|Gender=Masc|Mood=Ind|Number=Sing|POS=VERB|Tense=Past|VerbForm=Fin, Animacy=Inan|Case=Loc|Gender=Masc|Number=Plur|POS=NOUN, Case=Gen|POS=ADP, Animacy=Inan|Case=Gen|Gender=Neut|Number=Sing|POS=PRON|PronType=Dem, Case=Loc|Gender=Masc|NumType=Ord|Number=Sing|POS=ADJ|Uninflect=Yes, Abbr=Yes|Animacy=Inan|Case=Loc|Gender=Masc|Number=Sing|POS=NOUN|Uninflect=Yes, Case=Nom|NumType=Card|POS=DET|PronType=Ind, Animacy=Anim|Case=Gen|Gender=Masc|Number=Plur|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Neut|Number=Sing|POS=NOUN, Case=Gen|Number=Plur|POS=ADJ, Animacy=Inan|Case=Gen|Gender=Neut|Number=Plur|POS=NOUN, Case=Loc|Number=Plur|POS=ADJ, POS=SCONJ, Aspect=Imp|Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Aspect=Perf|POS=VERB|VerbForm=Inf, Degree=Pos|POS=ADV, Aspect=Imp|Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Animacy=Anim|Case=Nom|Number=Plur|POS=PRON|Person=2|PronType=Prs, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=2|Tense=Fut|VerbForm=Fin, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=DET|Poss=Yes|PronType=Prs|Reflex=Yes, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=NOUN, Case=Loc|Gender=Neut|Number=Sing|POS=DET|PronType=Dem, Animacy=Inan|Case=Loc|Gender=Neut|Number=Sing|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=DET|PronType=Dem, Animacy=Inan|Case=Acc|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=ADJ, Aspect=Perf|Mood=Ind|POS=VERB|Person=0|VerbForm=Fin, Case=Gen|Gender=Masc|NumType=Ord|Number=Sing|POS=ADJ|Uninflect=Yes, Animacy=Inan|Case=Loc|Gender=Fem|Number=Sing|POS=PROPN, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Animacy=Anim|Case=Acc|Number=Plur|POS=DET|PronType=Tot, POS=PART|Polarity=Neg, Animacy=Inan|Case=Gen|Gender=Neut|Number=Plur|POS=NOUN|Uninflect=Yes, Animacy=Inan|Case=Gen|Gender=Fem|Number=Sing|POS=NOUN, POS=PUNCT|PunctType=Quot, POS=PUNCT|PunctType=Dash, Aspect=Perf|Gender=Masc|Mood=Ind|Number=Sing|POS=VERB|Tense=Past|VerbForm=Fin, POS=ADV|PronType=Dem, Animacy=Inan|Case=Nom|Gender=Fem|Number=Sing|POS=NOUN, Animacy=Inan|Case=Gen|Gender=Fem|Number=Sing|POS=PROPN, Case=Acc|POS=ADP, Animacy=Inan|Case=Acc|Gender=Fem|Number=Sing|POS=NOUN, Case=Gen|Gender=Masc|Number=Sing|POS=DET|PronType=Ind, Aspect=Perf|Case=Gen|Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Animacy=Inan|Case=Gen|Gender=Neut|Number=Sing|POS=NOUN, Foreign=Yes|POS=X, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Tense=Past|VerbForm=Fin, Case=Ins|POS=ADP, Animacy=Inan|Case=Ins|Gender=Fem|Number=Plur|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Neut|Number=Sing|POS=PRON|PronType=Dem, Case=Nom|Number=Plur|POS=ADJ, Animacy=Anim|Case=Nom|Gender=Fem|Number=Plur|POS=NOUN, Abbr=Yes|Animacy=Inan|Case=Gen|Gender=Neut|Number=Sing|POS=NOUN|Uninflect=Yes, Animacy=Inan|Case=Nom|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Inan|Case=Nom|Gender=Fem|Number=Sing|POS=PROPN, Animacy=Inan|Case=Acc|Number=Ptan|POS=NOUN, Case=Nom|Number=Plur|POS=DET|PronType=Rel, Case=Ins|Number=Plur|POS=PRON|Person=3|PronType=Prs, Aspect=Imp|Mood=Ind|Number=Plur|POS=AUX|Tense=Past|VerbForm=Fin, Aspect=Perf|Case=Nom|Number=Plur|POS=ADJ|VerbForm=Part|Voice=Pass, Aspect=Perf|Case=Nom|Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Case=Nom|Number=Plur|POS=DET|Person=3|Poss=Yes|PronType=Prs|Uninflect=Yes, Animacy=Inan|Case=Nom|Gender=Neut|Number=Plur|POS=NOUN, Aspect=Imp|Gender=Masc|Mood=Ind|Number=Sing|POS=AUX|Tense=Past|VerbForm=Fin, Case=Ins|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Inan|Case=Dat|POS=PRON|PronType=Neg, Case=Nom|Degree=Pos|Number=Plur|POS=ADJ, Animacy=Anim|Case=Acc|Gender=Masc|Number=Plur|POS=NOUN, POS=SPACE, Case=Nom|Gender=Fem|Number=Sing|POS=DET|PronType=Tot, Case=Ins|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Inan|Case=Ins|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Inan|Case=Nom|Gender=Neut|Number=Sing|POS=PRON|PronType=Dem, Case=Nom|Gender=Masc|Number=Sing|POS=ADJ, Case=Gen|Degree=Pos|Number=Plur|POS=ADJ, Aspect=Perf|Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin, Case=Ins|Gender=Masc|Number=Sing|POS=ADJ, Aspect=Perf|POS=VERB|Tense=Past|VerbForm=Conv, Animacy=Inan|Case=Acc|Gender=Fem|Number=Plur|POS=NOUN, Aspect=Imp|Case=Gen|Gender=Neut|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin, Case=Acc|Degree=Cmp|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Inan|Case=Loc|Gender=Fem|Number=Sing|POS=NOUN, Animacy=Inan|Case=Acc|Degree=Pos|Number=Plur|POS=ADJ, Case=Loc|Degree=Pos|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Anim|Case=Gen|Number=Plur|POS=PRON|Person=2|PronType=Prs, Case=Nom|NumType=Card|POS=DET|PronType=Dem, Animacy=Anim|Case=Gen|Number=Ptan|POS=NOUN, Animacy=Inan|Case=Loc|Gender=Masc|Number=Sing|POS=PROPN, Case=Gen|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Anim|Case=Acc|Gender=Fem|Number=Sing|POS=NOUN, Aspect=Perf|Case=Gen|Number=Plur|POS=ADJ|VerbForm=Part|Voice=Pass, Case=Nom|Gender=Fem|Number=Sing|POS=PRON|Person=3|PronType=Prs, Aspect=Perf|Gender=Fem|Mood=Ind|Number=Sing|POS=VERB|Tense=Past|VerbForm=Fin, Animacy=Inan|Case=Gen|Number=Ptan|POS=NOUN, Abbr=Yes|Animacy=Anim|Case=Nom|Gender=Masc|NameType=Giv|Number=Sing|POS=PROPN|Uninflect=Yes, Abbr=Yes|Animacy=Anim|Case=Nom|Gender=Masc|NameType=Sur|Number=Sing|POS=PROPN|Uninflect=Yes, Animacy=Anim|Case=Nom|Gender=Masc|Number=Sing|POS=PRON|PronType=Int, Animacy=Inan|Case=Nom|Gender=Neut|Number=Sing|POS=PRON|PronType=Int, Case=Acc|Gender=Neut|Number=Sing|POS=ADJ, Case=Nom|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Anim|Case=Nom|Number=Plur|POS=PRON|Person=1|PronType=Prs, Animacy=Inan|Case=Acc|Number=Plur|POS=ADJ, Animacy=Inan|Case=Nom|Gender=Masc|Number=Sing|POS=PROPN|Uninflect=Yes, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Fut|VerbForm=Fin, Case=Gen|Gender=Fem|Number=Sing|POS=ADJ, Case=Acc|Gender=Neut|NumType=Ord|Number=Sing|POS=ADJ|Uninflect=Yes, Animacy=Inan|Case=Dat|Gender=Neut|Number=Sing|POS=NOUN, Case=Nom|Gender=Fem|Number=Sing|POS=DET|PronType=Rel, Animacy=Anim|Case=Gen|Gender=Fem|Number=Sing|POS=NOUN, Aspect=Perf|Case=Loc|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Animacy=Anim|Case=Gen|Gender=Masc|Number=Sing|POS=NOUN, Case=Gen|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ, Case=Nom|NumType=Card|POS=NUM|Uninflect=Yes, `Case=Nom|Degree=...

uk_core_news_lg-3.5.0

19 Jan 12:35
b40a197
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: e24e7130b573bf52cd6a3dd7237d4918f93b3218723eb7b9619351ddadec8942
Checksum .whl: f607e204e0300a0a53594a66454ae37c11e32e2fe402b63f528237b2419522b6

Details: https://spacy.io/models/uk#uk_core_news_lg

Ukrainian pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attribute_ruler, lemmatizer.

Feature Description
Name uk_core_news_lg
Version 3.5.0
spaCy >=3.5.0,<3.6.0
Default Pipeline tok2vec, morphologizer, parser, attribute_ruler, lemmatizer, ner
Components tok2vec, morphologizer, parser, senter, attribute_ruler, lemmatizer, ner
Vectors floret (200000, 300)
Sources Ukr-Synth (e5d9eaf3) (Volodymyr Kurnosov)
Explosion Vectors (OSCAR 2109 + Wikipedia + OpenSubtitles + WMT News Crawl) (Explosion)
License MIT
Author Explosion
Model size 220 MB

Label Scheme

View label scheme (1211 labels for 3 components)
Component Labels
morphologizer POS=CCONJ, Degree=Cmp|POS=ADV, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Animacy=Inan|Case=Nom|Gender=Fem|Number=Plur|POS=NOUN, Animacy=Inan|Case=Gen|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Inan|Case=Ins|Gender=Fem|Number=Sing|POS=NOUN, POS=PUNCT, Case=Gen|Number=Plur|POS=DET|PronType=Dem, Animacy=Inan|Case=Gen|Gender=Fem|Number=Plur|POS=NOUN, POS=ADV|PronType=Rel, POS=PART, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Tense=Past|VerbForm=Fin, Aspect=Imp|POS=VERB|VerbForm=Inf, Animacy=Inan|Case=Nom|Gender=Masc|Number=Plur|POS=NOUN, Animacy=Anim|Case=Nom|Gender=Masc|Number=Plur|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Masc|Number=Plur|POS=NOUN, Case=Loc|POS=ADP, Case=Loc|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Inan|Case=Loc|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Anim|Case=Nom|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Anim|Case=Nom|Gender=Masc|NameType=Giv|Number=Sing|POS=PROPN, Animacy=Anim|Case=Nom|Gender=Masc|NameType=Sur|Number=Sing|POS=PROPN, POS=ADV, Aspect=Imp|Gender=Masc|Mood=Ind|Number=Sing|POS=VERB|Tense=Past|VerbForm=Fin, Animacy=Inan|Case=Loc|Gender=Masc|Number=Plur|POS=NOUN, Case=Gen|POS=ADP, Animacy=Inan|Case=Gen|Gender=Neut|Number=Sing|POS=PRON|PronType=Dem, Case=Loc|Gender=Masc|NumType=Ord|Number=Sing|POS=ADJ|Uninflect=Yes, Abbr=Yes|Animacy=Inan|Case=Loc|Gender=Masc|Number=Sing|POS=NOUN|Uninflect=Yes, Case=Nom|NumType=Card|POS=DET|PronType=Ind, Animacy=Anim|Case=Gen|Gender=Masc|Number=Plur|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Neut|Number=Sing|POS=NOUN, Case=Gen|Number=Plur|POS=ADJ, Animacy=Inan|Case=Gen|Gender=Neut|Number=Plur|POS=NOUN, Case=Loc|Number=Plur|POS=ADJ, POS=SCONJ, Aspect=Imp|Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Aspect=Perf|POS=VERB|VerbForm=Inf, Degree=Pos|POS=ADV, Aspect=Imp|Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Animacy=Anim|Case=Nom|Number=Plur|POS=PRON|Person=2|PronType=Prs, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=2|Tense=Fut|VerbForm=Fin, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=DET|Poss=Yes|PronType=Prs|Reflex=Yes, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=NOUN, Case=Loc|Gender=Neut|Number=Sing|POS=DET|PronType=Dem, Animacy=Inan|Case=Loc|Gender=Neut|Number=Sing|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=DET|PronType=Dem, Animacy=Inan|Case=Acc|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=ADJ, Aspect=Perf|Mood=Ind|POS=VERB|Person=0|VerbForm=Fin, Case=Gen|Gender=Masc|NumType=Ord|Number=Sing|POS=ADJ|Uninflect=Yes, Animacy=Inan|Case=Loc|Gender=Fem|Number=Sing|POS=PROPN, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Animacy=Anim|Case=Acc|Number=Plur|POS=DET|PronType=Tot, POS=PART|Polarity=Neg, Animacy=Inan|Case=Gen|Gender=Neut|Number=Plur|POS=NOUN|Uninflect=Yes, Animacy=Inan|Case=Gen|Gender=Fem|Number=Sing|POS=NOUN, POS=PUNCT|PunctType=Quot, POS=PUNCT|PunctType=Dash, Aspect=Perf|Gender=Masc|Mood=Ind|Number=Sing|POS=VERB|Tense=Past|VerbForm=Fin, POS=ADV|PronType=Dem, Animacy=Inan|Case=Nom|Gender=Fem|Number=Sing|POS=NOUN, Animacy=Inan|Case=Gen|Gender=Fem|Number=Sing|POS=PROPN, Case=Acc|POS=ADP, Animacy=Inan|Case=Acc|Gender=Fem|Number=Sing|POS=NOUN, Case=Gen|Gender=Masc|Number=Sing|POS=DET|PronType=Ind, Aspect=Perf|Case=Gen|Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Animacy=Inan|Case=Gen|Gender=Neut|Number=Sing|POS=NOUN, Foreign=Yes|POS=X, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Tense=Past|VerbForm=Fin, Case=Ins|POS=ADP, Animacy=Inan|Case=Ins|Gender=Fem|Number=Plur|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Neut|Number=Sing|POS=PRON|PronType=Dem, Case=Nom|Number=Plur|POS=ADJ, Animacy=Anim|Case=Nom|Gender=Fem|Number=Plur|POS=NOUN, Abbr=Yes|Animacy=Inan|Case=Gen|Gender=Neut|Number=Sing|POS=NOUN|Uninflect=Yes, Animacy=Inan|Case=Nom|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Inan|Case=Nom|Gender=Fem|Number=Sing|POS=PROPN, Animacy=Inan|Case=Acc|Number=Ptan|POS=NOUN, Case=Nom|Number=Plur|POS=DET|PronType=Rel, Case=Ins|Number=Plur|POS=PRON|Person=3|PronType=Prs, Aspect=Imp|Mood=Ind|Number=Plur|POS=AUX|Tense=Past|VerbForm=Fin, Aspect=Perf|Case=Nom|Number=Plur|POS=ADJ|VerbForm=Part|Voice=Pass, Aspect=Perf|Case=Nom|Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Case=Nom|Number=Plur|POS=DET|Person=3|Poss=Yes|PronType=Prs|Uninflect=Yes, Animacy=Inan|Case=Nom|Gender=Neut|Number=Plur|POS=NOUN, Aspect=Imp|Gender=Masc|Mood=Ind|Number=Sing|POS=AUX|Tense=Past|VerbForm=Fin, Case=Ins|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Inan|Case=Dat|POS=PRON|PronType=Neg, Case=Nom|Degree=Pos|Number=Plur|POS=ADJ, Animacy=Anim|Case=Acc|Gender=Masc|Number=Plur|POS=NOUN, POS=SPACE, Case=Nom|Gender=Fem|Number=Sing|POS=DET|PronType=Tot, Case=Ins|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Inan|Case=Ins|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Inan|Case=Nom|Gender=Neut|Number=Sing|POS=PRON|PronType=Dem, Case=Nom|Gender=Masc|Number=Sing|POS=ADJ, Case=Gen|Degree=Pos|Number=Plur|POS=ADJ, Aspect=Perf|Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin, Case=Ins|Gender=Masc|Number=Sing|POS=ADJ, Aspect=Perf|POS=VERB|Tense=Past|VerbForm=Conv, Animacy=Inan|Case=Acc|Gender=Fem|Number=Plur|POS=NOUN, Aspect=Imp|Case=Gen|Gender=Neut|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin, Case=Acc|Degree=Cmp|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Inan|Case=Loc|Gender=Fem|Number=Sing|POS=NOUN, Animacy=Inan|Case=Acc|Degree=Pos|Number=Plur|POS=ADJ, Case=Loc|Degree=Pos|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Anim|Case=Gen|Number=Plur|POS=PRON|Person=2|PronType=Prs, Case=Nom|NumType=Card|POS=DET|PronType=Dem, Animacy=Anim|Case=Gen|Number=Ptan|POS=NOUN, Animacy=Inan|Case=Loc|Gender=Masc|Number=Sing|POS=PROPN, Case=Gen|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Anim|Case=Acc|Gender=Fem|Number=Sing|POS=NOUN, Aspect=Perf|Case=Gen|Number=Plur|POS=ADJ|VerbForm=Part|Voice=Pass, Case=Nom|Gender=Fem|Number=Sing|POS=PRON|Person=3|PronType=Prs, Aspect=Perf|Gender=Fem|Mood=Ind|Number=Sing|POS=VERB|Tense=Past|VerbForm=Fin, Animacy=Inan|Case=Gen|Number=Ptan|POS=NOUN, Abbr=Yes|Animacy=Anim|Case=Nom|Gender=Masc|NameType=Giv|Number=Sing|POS=PROPN|Uninflect=Yes, Abbr=Yes|Animacy=Anim|Case=Nom|Gender=Masc|NameType=Sur|Number=Sing|POS=PROPN|Uninflect=Yes, Animacy=Anim|Case=Nom|Gender=Masc|Number=Sing|POS=PRON|PronType=Int, Animacy=Inan|Case=Nom|Gender=Neut|Number=Sing|POS=PRON|PronType=Int, Case=Acc|Gender=Neut|Number=Sing|POS=ADJ, Case=Nom|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Anim|Case=Nom|Number=Plur|POS=PRON|Person=1|PronType=Prs, Animacy=Inan|Case=Acc|Number=Plur|POS=ADJ, Animacy=Inan|Case=Nom|Gender=Masc|Number=Sing|POS=PROPN|Uninflect=Yes, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Fut|VerbForm=Fin, Case=Gen|Gender=Fem|Number=Sing|POS=ADJ, Case=Acc|Gender=Neut|NumType=Ord|Number=Sing|POS=ADJ|Uninflect=Yes, Animacy=Inan|Case=Dat|Gender=Neut|Number=Sing|POS=NOUN, Case=Nom|Gender=Fem|Number=Sing|POS=DET|PronType=Rel, Animacy=Anim|Case=Gen|Gender=Fem|Number=Sing|POS=NOUN, Aspect=Perf|Case=Loc|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Animacy=Anim|Case=Gen|Gender=Masc|Number=Sing|POS=NOUN, Case=Gen|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ, Case=Nom|NumType=Card|POS=NUM|Uninflect=Yes, `Case=Nom|Degre...