You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In many cases, when the corpus contains misspelled or foreign words and phrases, top MWEs end up being those very rare misspelled expressions. This is a known problem when measuring PMI.
To Reproduce
Steps to reproduce the behavior:
Simply run MWE extraction and check the results.
Expected behavior
Top MWE results should be common expressions consisting of correct words.
Examples
Light Verb Constructions: LOCK THE DOOOOR
Possible Solutions
The proposed solution is to check the components of MWEs against a lexicon of the selected language to ensure they are actual words and not made-up words.
The text was updated successfully, but these errors were encountered:
meghdadFar
changed the title
Bug Report: Sometimes, MWEs are pretty rare and uncommon or misspelled phrases
Bug Report: Sometimes, MWEs are uncommon, wrong or misspelled phrases
Apr 8, 2024
meghdadFar
changed the title
Bug Report: Sometimes, MWEs are uncommon, wrong or misspelled phrases
Bug Report: Sometimes, MWEs are wrong or misspelled phrases
Apr 8, 2024
Description
In many cases, when the corpus contains misspelled or foreign words and phrases, top MWEs end up being those very rare misspelled expressions. This is a known problem when measuring PMI.
To Reproduce
Steps to reproduce the behavior:
Simply run MWE extraction and check the results.
Expected behavior
Top MWE results should be common expressions consisting of correct words.
Examples
Light Verb Constructions: LOCK THE DOOOOR
Possible Solutions
The proposed solution is to check the components of MWEs against a lexicon of the selected language to ensure they are actual words and not made-up words.
The text was updated successfully, but these errors were encountered: