New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Inadequate results in words separated by hyphen/minus #20
Comments
It is caused by the "Ignore modifiers of latin letters" option (activated by default). Actually, expanded versions of https://github.com/piroor/xulmigemo/blob/master/dics/latin-letters-with-marks.txt#L23
You'll see same result by the code
You'll see same result on the web console. After you turn off the checkbox and restart, an input
This doesn't match to text like |
To be honest, I'm not familiar to actual usecases of such special characters. If |
"Ignore modifiers of latin letters", is THE feature of XULmigemo for me as brazilian portuguese native speaker (regex is useful too, but less). not have it is a serious flaw of Firefox, so thank you for this extension. I know Chrome does this by default. I don't have it installed, but surely it woudn't find "t-w" when searching for "tw", which I believe is appropriate behavior. Fastest Search also works properly ("ignore diacritics" option), but it's intrusive and doesn't work on Places and urlbar. Maybe "t-w" should find "Ŧw" (I also unaware Ŧ), but "tw" shoudn't find "t-w". Please also note it's highligting more than expected. "t-w" highlights "t-wei", "x-wi" highlights "x-width" and so on. Another example: https://en.wikipedia.org/wiki/Cruzeiro_Esporte_Clube |
…gnore modifiers of latin letters" option #20
OK, I've removed such patterns from the generated regular expressions by e216063. Thank you for the advice!
It is caused by dictionary-assisted search feature. As I commented at #20 (comment) XUL/Migemo lists terms extracted from the dictionary so you'll see such expanded matching results. Initially XUL/Migemo was developed to assist incremental search for Japanese people and such dictionary-assisted search is very required for us. In Japanese text, same term can appear in different forms. For example, "Japan" can be "nihon", "にほん", "ニホン", or "日本". Moreover, in Japanese, the input "nihon" ("ni-hon") can mean "double" so we possibly want to find more terms like "2本", "二本" from same input. Then XUL/Migemo generates a large regexp ike "nihon|にほん|ニホン|日本|2本|二本" and find it from the webpage. Thus we don't need to input exact term to search - we can search various terms via simple ASCII input. Sadly, English dictionary is not designed to improve your search experience like for Japanese text. (I just added the en-US mode to pass the editors' review on AMO.) If you have any idea to improve your search experience with well-designed dictionary, please send pull request to modify it https://github.com/piroor/xulmigemo/tree/master/dics/en-US freely :) |
Page: https://addons.mozilla.org/en-US/firefox/files/browse/440201/file/files/foreground.js
The text was updated successfully, but these errors were encountered: