Update lex_attrs.py #5608

myavrum · 2020-06-19T05:30:16Z

Description

Some typographical corrections have been made in

Types of change

Checklist

I have submitted the spaCy Contributor Agreement.
I ran the tests, and all new and existing tests passed.
My changes don't require a change to the documentation, or if they do, I've added all required information.

explosion-bot · 2020-06-19T05:30:57Z

Hi @myavrum, thanks for your pull request! 👍 It looks like you haven't filled in the spaCy Contributor Agreement (SCA) yet. The agreement ensures that we can use your contribution across the project. Once you've filled in the template, put it in the .github/contributors directory and add it to this pull request. If your pull request targets a branch that's not master, for example develop, make sure to submit the Contributor Agreement to the master branch. Thanks a lot!

If you've already included the Contributor Agreement in your pull request above, you can ignore this message.

svlandeg · 2020-06-19T16:34:51Z

Contributor agreement merged in PR #5612

svlandeg

Thanks a lot for your contribution, we really appreciate it!

spacy/lang/hy/lex_attrs.py

svlandeg · 2020-06-19T18:00:27Z

I'll just go ahead and merge this, so @mahnerak can follow up with a few additional commits in a new PR, without creating a conflict :-)

mahnerak · 2020-06-19T18:36:49Z

@svlandeg
According to the changes in this PR:
By code hy we denote (eastern) Armenian with modern orthography. (hyw, xcl, arm for the other variations). Also, the tag_map obtained from UniversalDependencies-Armenian-TDP corresponds only to the modern eastern Armenian orthography.

So yes, these are only correct versions for hy.

* Fix typos and auto-format [ci skip] * Add pkuseg warnings and auto-format [ci skip] * Update Binder URL [ci skip] * Update Binder version [ci skip] * Update alignment example for new gold.align * Update POS in tagging example * Fix numpy.zeros() dtype for Doc.from_array * Change example title to Dr. Change example title to Dr. so the current model does exclude the title in the initial example. * Fix spacy convert argument * Warning for sudachipy 0.4.5 (#5611) * Create myavrum.md (#5612) * Update lex_attrs.py (#5608) * Create mahnerak.md (#5615) * Some changes for Armenian (#5616) * Fixing numericals * We need a Armenian question sign to make the sentence a question * Add Nepali Language (#5622) * added support for nepali lang * added examples and test files * added spacy contributor agreement * Japanese model: add user_dict entries and small refactor (#5573) * user_dict fields: adding inflections, reading_forms, sub_tokens deleting: unidic_tags improve code readability around the token alignment procedure * add test cases, replace fugashi with sudachipy in conftest * move bunsetu.py to spaCy Universe as a pipeline component BunsetuRecognizer * tag is space -> both surface and tag are spaces * consider len(text)==0 * Add warnings example in v2.3 migration guide (#5627) * contribute (#5632) * Fix polarity of Token.is_oov and Lexeme.is_oov (#5634) Fix `Token.is_oov` and `Lexeme.is_oov` so they return `True` when the lexeme does **not** have a vector. * Extend what's new in v2.3 with vocab / is_oov (#5635) * Skip vocab in component config overrides (#5624) * Fix backslashes in warnings config diff (#5640) Fix backslashes in warnings config diff in v2.3 migration section. * Disregard special tag _SP in check for new tag map (#5641) * Skip special tag _SP in check for new tag map In `Tagger.begin_training()` check for new tags aside from `_SP` in the new tag map initialized from the provided gold tuples when determining whether to reinitialize the morphology with the new tag map. * Simplify _SP check Co-authored-by: Ines Montani <ines@ines.io> Co-authored-by: Marat M. Yavrumyan <myavrum@ysu.am> Co-authored-by: Karen Hambardzumyan <mahnerak@gmail.com> Co-authored-by: Rameshh <30867740+rameshhpathak@users.noreply.github.com> Co-authored-by: Hiroshi Matsuda <40782025+hiroshi-matsuda-rit@users.noreply.github.com> Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

* Fix on_match callback and remove empty patterns (#6312) For the `DependencyMatcher`: * Fix on_match callback so that it is called once per matched pattern * Fix results so that patterns with empty match lists are not returned * Add --prefer-binary for python 3.5 * Add version pins for pyrsistent * Use backwards-compatible super() * Try to fix tests on Travis (2.7) * Fix naming conflict and formatting * Update pkuseg version in Chinese tokenizer warnings * Some changes for Armenian (#5616) * Fixing numericals * We need a Armenian question sign to make the sentence a question * Update lex_attrs.py (#5608) * Fix compat * Update Armenian from v2.3.x Co-authored-by: Ines Montani <ines@ines.io> Co-authored-by: Karen Hambardzumyan <mahnerak@gmail.com> Co-authored-by: Marat M. Yavrumyan <myavrum@ysu.am>

Update lex_attrs.py

7b878b6

svlandeg added enhancement Feature requests and improvements lang / hy Armenian language data and models labels Jun 19, 2020

svlandeg reviewed Jun 19, 2020

View reviewed changes

svlandeg mentioned this pull request Jun 19, 2020

Armenian language support #5246

Merged

3 tasks

svlandeg reviewed Jun 19, 2020

View reviewed changes

spacy/lang/hy/lex_attrs.py Show resolved Hide resolved

svlandeg merged commit 8120b64 into explosion:master Jun 19, 2020

mahnerak pushed a commit to YerevaNN/spaCy that referenced this pull request Jun 19, 2020

Update lex_attrs.py (explosion#5608)

7c9536f

adrianeboyd pushed a commit that referenced this pull request Nov 12, 2020

Update lex_attrs.py (#5608)

24f061a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update lex_attrs.py #5608

Update lex_attrs.py #5608

myavrum commented Jun 19, 2020 •

edited

explosion-bot commented Jun 19, 2020

svlandeg commented Jun 19, 2020

svlandeg left a comment •

edited

svlandeg commented Jun 19, 2020

mahnerak commented Jun 19, 2020

Update lex_attrs.py #5608

Update lex_attrs.py #5608

Conversation

myavrum commented Jun 19, 2020 • edited

Description

Types of change

Checklist

explosion-bot commented Jun 19, 2020

svlandeg commented Jun 19, 2020

svlandeg left a comment • edited

Choose a reason for hiding this comment

svlandeg commented Jun 19, 2020

mahnerak commented Jun 19, 2020

myavrum commented Jun 19, 2020 •

edited

svlandeg left a comment •

edited