You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Python version: 3.4.3, latest spaCy (0.85), redownloaded all data (python -m spacy.en.download all).
It looks like there's something wrong with sentence splitting. Example:
pipeline = spacy.en.English()
tokens = pipeline("The Germans have their Faust; but Faust is a tragedy with a cosmic philosophic theme.")
root_inds = [ind for ind, token in enumerate(tokens) if token.dep_ == "ROOT"]
root_inds has two elements - corresponding sentences are "The Germans have their Faust; but" and "Faust is a tragedy with a cosmic philosophical theme.".
Your example now parses correctly, and accuracy is up on aggregate. Further improvements to sentence boundary detection accuracy should be forth-coming.
Please keep reporting prominent failures as they occur.
Python version: 3.4.3, latest spaCy (0.85), redownloaded all data (python -m spacy.en.download all).
It looks like there's something wrong with sentence splitting. Example:
root_inds
has two elements - corresponding sentences are "The Germans have their Faust; but" and "Faust is a tragedy with a cosmic philosophical theme.".I guess this is related to the ROOT bug #57.
The text was updated successfully, but these errors were encountered: