Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing information for part of speech - French #1958

Closed
genbien opened this issue Feb 8, 2018 · 2 comments
Closed

Missing information for part of speech - French #1958

genbien opened this issue Feb 8, 2018 · 2 comments
Labels
lang / fr French language data and models models Issues related to the statistical models

Comments

@genbien
Copy link

genbien commented Feb 8, 2018

Hello,

I'm using spaCy's part of speech tagger but sometimes there's no output when using .pos_. The following script can reproduce the bug and shows a comparison between the section of missing information in .pos_ with what's found in .tag_:

nlp = spacy.load('fr_core_news_sm')
doc = nlp("Pour poser des congés, qu'est-ce que je fais ?")

tagged_pos = [[token.text, token.pos_] for token in doc]
tagged_tag = [[token.text, token.tag_] for token in doc]

pprint(tagged_pos)
pprint(tagged_tag)

Relevant output with missing part of speech information when using .pos_:

 ["qu'", ''],
 ['est', ''],

But when using .tag_, part of speech is found:

 ["qu'", 'ADV'],
 ['est', 'VERB'],

Is this just a situation where the relevant keys are missing from the tag_map (a very quick fix), or is there potentially something else going on here?

(Also, I'm not sure qu' is an adverb but maybe that's a separate issue...)

Your Environment

  • Operating System: MacOS 10.13.2
  • Python Version Used: 3.6.3
  • spaCy Version Used: 2.0.7
  • spaCy Model Used: fr_core_news_sm
  • Environment Information:
@ines ines added models Issues related to the statistical models lang / fr French language data and models labels Feb 9, 2018
@ines
Copy link
Member

ines commented Dec 14, 2018

The empty POS tags should be fixed in the upcoming models for v2.1.0!

I'm also merging this with #3052. We've now added a master thread for incorrect predictions and related reports – see the issue for more details.

@ines ines closed this as completed Dec 14, 2018
@lock
Copy link

lock bot commented Jan 13, 2019

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

@lock lock bot locked as resolved and limited conversation to collaborators Jan 13, 2019
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
lang / fr French language data and models models Issues related to the statistical models
Projects
None yet
Development

No branches or pull requests

2 participants