Research on the feasibility of using string hashing as the skeleton and unit ID for XLIFF files #20215
Replies: 3 comments
-
|
For a bit of context: In the GUI for example, we have "Off" translated differently depending on the context ( Though, in practice, I realize that documentation strings are much longer. I'm not sure if there are such translation conflicts in practice. So maybe it's just possible to remove IDs. Or better, remove them but implement a disambiguation possibility only when needed. |
Beta Was this translation helpful? Give feedback.
-
|
I think using string hashes as IDs can fully meet these requirements:
|
Beta Was this translation helpful? Give feedback.
-
|
I've converted this to a discussion because as-is this cannot be triaged. Feel free to perform this research in a separate repo / crowdin set up. Our current understanding is the approach might be worse. Without clear feasibility testing we can't triage this. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Related issues, PRs or discussions
None
What is the current state of the codebase?
Currently, the unit IDs of document translation strings are randomly generated.
Why are changes required?
This has the following disadvantages:
### Important notes### New Features### Changes### Bug Fixes### Changes for DevelopersPlease refer to [the developer guide](https://download.nvaccess.org/documentation/developerGuide.html#API) for information on NVDA's API deprecation and removal process.#### Deprecations| Name |Desktop key |Laptop key |Description|| Name |key |Description|| Option | Behaviour || Options | Disabled, Enabled || Default | Disabled || Default | Enabled |What technical changes are required?
When a new string appears, use a hash of the English string (such as MD5, SHA1, or SHA256) as the filler content for the skeleton part and as the unit ID of the translated string. To ensure sufficient translation accuracy, the hash calculation should include the prefix and suffix of the string in that line. For compatibility reasons, currently translated strings can remain as they are.
Are the proposed technical changes API breaking?
No
Are there potential risks or issues with the proposed implementation?
Uncertain
Beta Was this translation helpful? Give feedback.
All reactions