You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The current implementation of the AbbreviationDetector() does not handle abbreviations that contain a short form followed by a space followed by a number
For example, in this scenario:
The Proceeds of Crime Act 2002 ("PoCA 2000")
The abbreviation is not matched.
The original implementation in scispaCy does not appear to have been built to handle instances in which the short form is bounded by quote marks).
The text was updated successfully, but these errors were encountered:
ICLRandD
changed the title
Abbreviation detection not working where short form contains a space followed by digits
⚫ Abbreviation detection not working where short form contains a space followed by digits
Aug 7, 2019
ICLRandD
changed the title
⚫ Abbreviation detection not working where short form contains a space followed by digits
Abbreviation detection not working where short form contains a space followed by digits
Aug 7, 2019
In [1]: from abbreviations import schwartz_hearst
In [2]: schwartz_hearst.extract_abbreviation_definition_pairs(doc_text='The Proceeds of Crime Act 2002 ("PoCA 2002")')
Out[2]: {'PoCA 2002': 'Proceeds of Crime Act 2002'}
The current implementation of the
AbbreviationDetector()
does not handle abbreviations that contain a short form followed by a space followed by a numberFor example, in this scenario:
The abbreviation is not matched.
The original implementation in scispaCy does not appear to have been built to handle instances in which the short form is bounded by quote marks).
The text was updated successfully, but these errors were encountered: