You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A-dataArea: Data coverage or qualityA-designArea: Architecture or designC-dnamesComponent: Language/Region/... Display NamesS-mediumSize: Less than a week (larger bug fix or enhancement)T-coreType: Required functionality
The DisplayNames component comes with a large amount of data. It is the largest locale-specific data in ICU and will also likely be the largest in ICU4X.
There are a few things that make DisplayNames interesting:
The majority of the display names are probably not useful to carry for most clients. For example, users speaking Japanese are more likely to need the translation for the Katakana script than the translation for the Cherokee script. We should explore something like japanext and likelysubtagsext where we have a core set and an extended set.
Regional variants often override only a small number of strings. For example, en-GB and en-US might be equivalent for all region names except for one or two. This doesn't play nicely with the deduplication mechanism we've thusfar relied on.
I think 2 is a big issue, and I think it also happens for other data. We could, instead of loading a single data struct in the formatter constructor, load all structs for the whole fallback chain. This could use naive fallback (i.e. chopping off tags), so no additional data would be needed. We can then remove redundant entries from en-GB and en-001 if they are in en (if we're using naive we'd still have duplication across GB and 001 though).
Discussed on 2023-07-04. We will use the auxiliary key model, similar to currency formatter (#1441), which resolves the issues in the OP.
sffc
removed
discuss
Discuss at a future ICU4X-SC meeting
discuss-triaged
The stakeholders for this issue have been identified and it can be discussed out-of-band
labels
Jul 5, 2023
A-dataArea: Data coverage or qualityA-designArea: Architecture or designC-dnamesComponent: Language/Region/... Display NamesS-mediumSize: Less than a week (larger bug fix or enhancement)T-coreType: Required functionality
The DisplayNames component comes with a large amount of data. It is the largest locale-specific data in ICU and will also likely be the largest in ICU4X.
There are a few things that make DisplayNames interesting:
CC @snktd @robertbastian @markusicu
The text was updated successfully, but these errors were encountered: