Glossary: Match single word entries of parts of speech that have no suffix rules. #1791

pedro-mendonca · 2024-02-16T12:24:55Z

Problem

For any types of glossary items (part of speech), if the entry has multiple words won't try to add any suffixes, and is immediately added to the glossary list.

As explained in Meta Ticket https://meta.trac.wordpress.org/ticket/7473, if the glossary entry is a single word, currently it will only be added if there are rules for suffixes, which is wrong. Even if there isn't any rule, at least the exact word should be added.

Solution

For single word glossary terms of a part_of_speech that have no suffix rules, add the exact match to the glossary list.

Fixes #1790

Thanks @marcarmengou for reporting.

…rays

…e no suffix rules.

pedro-mendonca · 2024-02-16T12:58:40Z

There is one case on .org that I can't find the reason why isn't behaving the same:
The entry "FAQ" (abbreviation) is matched, despite every local tests having a different behaviour.
On my install, only after this PR the entry "FAQ" is matched.

I would expect this if there were any rules added for suffixes in the abbreviation array key in gp_glossary_match_suffixes.
But couldn't find any customization to the filter gp_glossary_match_suffixes on w.org.

A translation where it is matching
https://translate.wordpress.org/projects/wp/dev/admin/gl/default/?filters%5Bterm%5D=FAQ&filters%5Bcase_sensitive%5D=yes&filters%5Bterm_scope%5D=scope_originals&filters%5Bstatus%5D=current

The glossary where "FAQ" is added.
https://translate.wordpress.org/locale/pt/default/glossary/

marcarmengou · 2024-02-16T13:27:53Z

In Romanian and Swedish are marked as "expression" and are also currently matched.

Romanian: https://translate.wordpress.org/projects/wp/dev/admin/ro/default/?filters%5Bterm%5D=FAQ&filters%5Bcase_sensitive%5D=yes&filters%5Bterm_scope%5D=scope_originals&filters%5Bstatus%5D=current

Swedish: https://translate.wordpress.org/projects/wp/dev/admin/sv/default/?filters%5Bterm%5D=FAQ&filters%5Bcase_sensitive%5D=yes&filters%5Bterm_scope%5D=scope_originals&filters%5Bstatus%5D=current

pedro-mendonca · 2024-02-16T13:34:41Z

From my testing, found that for strings that are the exact match of the glossary, this isn't a problem, the match always happens.
The issue is exactly only for strings that contain the glossary match cases.

pedro-mendonca · 2024-02-16T14:20:38Z

Confirmed.
Strings that are multiple worded sentences are splitted by the glossary terms search made by gp_glossary_add_suffixes.
Then, these chunks are compared to the array of glossary keys. If the splitted term matches a key in the glossary terms array, there is a match.
For these cases, the current PR fixes the previously missing match.

For strings that are a single word, the splitting will always return the exact key of the glossary entry, so, it didn't have the issue of matching through the gp_glossary_add_suffixes.

…ts_of_speech

pedro-mendonca · 2024-02-16T14:29:15Z

Added a test for all the same terms belonging to single term strings.
Are not needed for the current PR, because were already working fine, but adding here might help debugging for further issues.

Also, as there are many strings to test, updated the test with a Data provider.

pedro-mendonca added 2 commits February 16, 2024 12:10

Add failing test for matching empty suffixes part_of_speech suffix ar…

c48b1f3

…rays

Add match for single word glossary entry of a part_of_speech that hav…

7d939f0

…e no suffix rules.

pedro-mendonca requested a review from amieiro February 16, 2024 12:25

Test strings that are the exact match of glossary entries, of all par…

e57be08

…ts_of_speech

Merge branch 'develop' into issue-glossary-matching

b34c822

amieiro self-assigned this Feb 23, 2024

Merge branch 'develop' into issue-glossary-matching

05853ca

amieiro approved these changes Feb 23, 2024

View reviewed changes

amieiro merged commit e7bc754 into GlotPress:develop Feb 23, 2024
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Glossary: Match single word entries of parts of speech that have no suffix rules. #1791

Glossary: Match single word entries of parts of speech that have no suffix rules. #1791

pedro-mendonca commented Feb 16, 2024 •

edited

pedro-mendonca commented Feb 16, 2024

marcarmengou commented Feb 16, 2024

pedro-mendonca commented Feb 16, 2024

pedro-mendonca commented Feb 16, 2024

pedro-mendonca commented Feb 16, 2024 •

edited

Glossary: Match single word entries of parts of speech that have no suffix rules. #1791

Glossary: Match single word entries of parts of speech that have no suffix rules. #1791

Conversation

pedro-mendonca commented Feb 16, 2024 • edited

Problem

Solution

pedro-mendonca commented Feb 16, 2024

marcarmengou commented Feb 16, 2024

pedro-mendonca commented Feb 16, 2024

pedro-mendonca commented Feb 16, 2024

pedro-mendonca commented Feb 16, 2024 • edited

pedro-mendonca commented Feb 16, 2024 •

edited

pedro-mendonca commented Feb 16, 2024 •

edited