Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature request: search form, new checkbox "Accent/Diacritic Match" #11028

Open
conky77 opened this issue Jan 10, 2022 · 2 comments
Open

Feature request: search form, new checkbox "Accent/Diacritic Match" #11028

conky77 opened this issue Jan 10, 2022 · 2 comments

Comments

@conky77
Copy link
Contributor

conky77 commented Jan 10, 2022

Description of the Issue

I propose a new feature (see also this conversation ):
On the search form (for all tab sheets), create a new checkbox
"Accent/Diacritic Match" (see images below
The new flag should work in combination with:

  • match case (the current behaviour with Regular Expression ( pattern "[[=e=]]" doesn't Match case )
  • Match whole word only
  • Backward Direction
  • Search mode: Normal and extended
    The new feature should work with all variants of vowels and consonant:s, i.e.:
    e: e è é ë ê
    n: n Ñ
    c: Ç
    L: Ł ƚ

Actual Behavior

The only way to search with accent insensitive is actually to enable "regular expression and substitute every letter with the pattern "[[=e=]]" (i.e. for letter "e")

Debug Information

Notepad++ v8.2 (64-bit)
Build time : Dec 30 2021 - 03:40:42
Path : C:\Program Files\Notepad++\notepad++.exe
Command Line : -multiInst
Admin mode : ON
Local Conf mode : OFF
Cloud Config : OFF
OS Name : Windows 10 Home (64-bit)
OS Version : 2009
OS Build : 22000.376
Current ANSI codepage : 1252
Plugins : ComparePlugin.dll mimeTools.dll NppConverter.dll NppExport.dll NppXmlTreeviewPlugin.dll

feature request: form with new flag
image

current form:

image

@alankilborn
Copy link
Contributor

I don't thing the terminology "Match accent/diacritic" is good.
"Match case" means exactly match the specified text.
I would think "Match accent/diacritic" ticked would mean match exactly the accented characters specified, but I think the functionality you are after does not mean that.
Perhaps "Consider characters to possibly be accented" (I know, too long :P ) would be more indicative of the functionality.

Where is a complete list of the accented characters, or does your 4 lines cover it?
Or, is the software supposed to just take something like testing and convert it into [[==t==]][[==e==]][[==s==]][[==t==]][[==i==]][[==n==]][[==g==]] before executing the search?

@conky77
Copy link
Contributor Author

conky77 commented Jan 10, 2022

The label: Microsoft and SQL server use (for the collations) "Accent sensitive or accent insensitive" (diacritic is more generic than "accent"): Is it a solution?.
I think that creating (and maintaining) a list is too difficult. I think that converting "testing" into "[[==t==]][[==e==]][[==s==]][[==t==]][[==i==]][[==n==]][[==g==]]" is a good idea. Maybe the only problem of this way is that you can't distinct uppercase and lowercase: you should disable "match case". But...something is better than nothing: it could be a beginning.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants