Languages should be sorted alphabetically based on what users see #3295

mmahmoudian · 2021-10-12T12:53:41Z

Describe the bug
On the main website's dataset's page, the languages are sorted based on their ISO code, but the text is the actual language name in English. This makes it hard to find the language user is after. For instance I was looking for "Persian" among items starting with "P" and when I didn't find it I tried to find "Farsi" among "F"s and at the first look I failed to find it but I was sure that it exists because I myself contributed a lot, so I grep the page source code and found:

<option value="eu">Basque</option>
<option value="fa">Persian</option>
<option value="fi">Finnish</option>
<option value="fr">French</option>
<option value="fy-NL">Frisian</option>

This would affect a lot of languages including but not limited to Welsh (cy), Spanish (es),

So based on a quick analysis these are the distances that the current situation is compared to what users expect them to see:

As you can see the distance can be extremely far for some of the entries!

To Reproduce

Open https://commonvoice.mozilla.org/en/datasets

Expected behavior
Languages are sorted alphabetically and based on the visible text to user and not the language codes that is not visible to user.

Screenshots

Desktop or Mobile (please complete the following information):

OS: Manjaro
Browser: Firefox
Version 93.0 (64-bit)

Additional Hardware (were you using headphones, an external speaker or an external microphone?):
--irrelevant--

Additional context
--irrelevant--

The text was updated successfully, but these errors were encountered:

This commit adds localized sort logic for the language dropdown. It gets the language name for the locales and then uses localized comparison so that non-Latin and accented characters are sorted correctly.

This commit sets up a new hook that can be used to fetch a list of sorted locales. The list is sorted based on the name of each locale, localized for the current locale of the client.

…LanguageSelect

…localization. (#3301) * #3295: Add sort logic for locales. This commit adds localized sort logic for the language dropdown. It gets the language name for the locales and then uses localized comparison so that non-Latin and accented characters are sorted correctly. * #3295: Remove unnecessary empty line additions. * #3295: Add `useSortedLocales` hook; Specify `getString` type This commit sets up a new hook that can be used to fetch a list of sorted locales. The list is sorted based on the name of each locale, localized for the current locale of the client. * #3295: Leverage `useSortedLocales` hook for Datasets and LanguageSelect * #3295: Insert spaces to make import style consistent

ChristianMMacy added a commit to ChristianMMacy/common-voice that referenced this issue Oct 17, 2021

common-voice#3295: Remove unnecessary empty line additions.

69c3cb0

ChristianMMacy mentioned this issue Oct 17, 2021

Sort languages in the Data Sets > Languages dropdown, accounting for localization. #3301

Merged

ChristianMMacy added a commit to ChristianMMacy/common-voice that referenced this issue Oct 19, 2021

common-voice#3295: Leverage useSortedLocales hook for Datasets and …

89eec03

…LanguageSelect

ChristianMMacy added a commit to ChristianMMacy/common-voice that referenced this issue Oct 20, 2021

common-voice#3295: Insert spaces to make import style consistent

893ac3a

phirework closed this as completed in #3301 Oct 20, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Languages should be sorted alphabetically based on what users see #3295

Languages should be sorted alphabetically based on what users see #3295

mmahmoudian commented Oct 12, 2021 •

edited

Languages should be sorted alphabetically based on what users see #3295

Languages should be sorted alphabetically based on what users see #3295

Comments

mmahmoudian commented Oct 12, 2021 • edited

mmahmoudian commented Oct 12, 2021 •

edited