Use locale grouping to format numbers #1781

nkottary · 2020-04-06T05:53:27Z

Currently numbers are grouped in groups of 3 regardless of locale. This PR implements grouping by looking at locale. I took some screenshots by using wxIntegerValidator on wxTextCtrl:

Default grouping or grouping with "en_US", "en_UK" etc

Grouping with "en_IN"

vadz

Thanks for the changes! We definitely should support locale-specific grouping, however this PR can't be merged as is because:

It breaks compatibility by modifying an existing function: this should be simple to fix by just adding a new function and keeping the old one as a wrapped for it.
It changes Mac code almost certainly incorrectly, which probably explains the crashes in the Travis CI builds under this platform. This should be easy to fix too, just by creating the string properly.
It results in test failures under MSW in AppVeyor builds. I'm not sure why is this so, but it definitely needs to be fixed.

Speaking of tests, it would be great to have tests checking that formatting/grouping in Indian locale works as expected (the test could be skipped if Indian locale is not available). Ideal would be to factor out wxNumberFormatter::AddThousandsSeparators() into a testable public function and add tests to it to the test suite.

Could you please try to fix these problems? If you have any questions, please post to wx-dev to discuss them.

TIA!

include/wx/numformatter.h

interface/wx/intl.h

src/common/intl.cpp

nkottary · 2020-04-06T16:35:34Z

Thank you for the detailed comments @vadz !

Co-Authored-By: VZ <vz-github@zeitlins.org>

vadz

Thanks for the fixes, but there seems to be a serious problem here: the code just returns whatever localeconv() or GetLocaleInfo(SGROUPING) returns directly, but they use different formats. Notably MSW doesn't use CHAR_MAX as delimiter, see documentation which probably explains the CI failures. And, FWIW, macOS uses a different, but much simpler convention, with just (primary) groupingSize and secondaryGroupingSize.

Anyhow, we need to standardize on one convention, maybe MSW one as it seems simpler, document it and return the grouping information in the same format under all platforms to make this work.

interface/wx/numformatter.h

nkottary · 2020-04-11T11:48:48Z

Thanks! I will look into the MacOS thing once I get Windows tests to pass.

vadz · 2020-04-19T12:13:45Z

Thanks for the updates! Other than the tests failures, that should be fixed, of course, I wonder why do the newly added tests don't just call FormatNumber() directly, with the different values of grouping? This was one of the ideas behind making this function public in the first place...

Also, it's a bit too late for this advice as you've already written the code using this style (and you do not need to change it), but if you write more tests, please feel free to use CATCH framework directly. This is much simpler than using legacy CppUnit macros, as you can just write functions (instead of classes, with methods, special set up and tear down methods and what not) and use a single CHECK() macro instead of many different ones. See e.g. tests/rowheightcache/rowheightcachetest.cpp for a simple example of a test written in this way or tests/validators/valtext.cpp for a closer (but slightly more complicated, as using section) example. Also note that you can combine both tests in the same file, i.e. you can just add TEST_CASE()s to it as in e.g. tests/filename/filenametest.cpp.

nkottary · 2020-04-22T05:39:14Z

Looking at the test failures it seems that the grouping string returned from GetLocaleInfo on Windows does not end with '0' when it should. On my laptop I have a Visual Studio 2019 build of this PR with nmake and test.exe runs fine on that. So I'm not sure why it is different on appveyor.

vadz · 2020-04-22T20:58:24Z

Strange, the behaviour of the system function such as ::GetLocaleInfo() shouldn't depend on the compiler of the build method. It might depend on the OS version but MSDN doesn't say anything about it changing. It could depend on locale, of course. Perhaps you could add some debugging statements to check locale and the value of SGROUPING on AppVeyor?

nkottary · 2020-04-26T10:00:04Z

Indeed the grouping string is different on appveyor for en_IN locale. It is 3;2. Whereas on my Linux and Windows 10 systems it is 3;2;0.

vadz · 2020-04-26T14:24:56Z

This is indeed annoying, apparently locale info depends on MSW version (AppVeyour is probably using something different from normal desktop one). Ideal would be to let the tests work in both cases, but you could also just skip them if the grouping is not the expected one.

Perhaps you could combine it with my suggestion to test FormatNumber() directly to retarget the existing tests to it?

nkottary · 2020-04-27T04:47:53Z

Okay, I will add tests for FormatNumber()

nkottary · 2020-04-27T12:08:16Z

Regarding the CI failure, I have seen the socketStream test fail before. It passes if I re-run the test by closing and re-opening the PR. In any case, it seems to be unrelated to this PR.

nkottary · 2020-11-05T06:50:03Z

Hi, can this be merged?

vadz

Thanks, I've redone reviewing this and it should indeed be merged, even though I'd like to make a few more changes first, so it's going to take some time.

The remarks below are mostly reminders for myself, but if you have any comments about them, please don't hesitate to leave your comments.

interface/wx/intl.h

interface/wx/numformatter.h

vadz · 2020-11-05T22:21:30Z

include/wx/numformatter.h

+
+    // Format a number s with the specified thousands separator, decimal separator
+    // and grouping format for the thousands separator
+    static void FormatNumber(wxString &s, wxChar thousandsSep,


Sorry, I don't remember any more, but was there a reason to pass s as output parameter here instead of just returning it? Is it done for efficiency or is there something else?

I just followed the declaration of AddThousandsSeparators(wxString& s).

interface/wx/numformatter.h

vadz · 2020-11-05T22:23:31Z

misc/languages/langtabl.txt

@@ -69,6 +69,7 @@ wxLANGUAGE_ENGLISH_PHILIPPINES         en_PH  LANG_ENGLISH     SUBLANG_ENGLISH_P
 wxLANGUAGE_ENGLISH_SOUTH_AFRICA        en_ZA  LANG_ENGLISH     SUBLANG_ENGLISH_SOUTH_AFRICA        LTR    "English (South Africa)"
 wxLANGUAGE_ENGLISH_TRINIDAD            en_TT  LANG_ENGLISH     SUBLANG_ENGLISH_TRINIDAD            LTR    "English (Trinidad)"
 wxLANGUAGE_ENGLISH_ZIMBABWE            en_ZW  LANG_ENGLISH     SUBLANG_ENGLISH_ZIMBABWE            LTR    "English (Zimbabwe)"
+wxLANGUAGE_ENGLISH_INDIA               en_IN  LANG_ENGLISH     SUBLANG_ENGLISH_INDIA               LTR    "English (India)"


I believe this should be moved after wxLANGUAGE_ENGLISH_EIRE to keep things in alphabetical order.

Fixed in a2ef888

src/common/intl.cpp

include/wx/intl.h

src/common/intl.cpp

vadz · 2020-11-05T23:22:31Z

src/common/numformatter.cpp

+            s_thousandsSeparator = s[0];
+            const wxString
+                g = wxLocale::GetInfo(wxLOCALE_GROUPING, wxLOCALE_CAT_NUMBER);
+            if ( g[0] != '\0')


I don't understand this test... First of all, I think we should always overwrite s_grouping, as it can have the value corresponding to the old locale. Second, AFAICS g will never have a NUL byte inside it.

Yes, I think it is unnecessary. I will remove it.

Fixed in 9ed8ad9

vadz · 2020-11-05T23:25:56Z

src/common/numformatter.cpp

+    {
+        const wxString
+            s = wxLocale::GetInfo(wxLOCALE_THOUSANDS_SEP, wxLOCALE_CAT_NUMBER);
+        if ( s.length() == 1 )


Note to self: need to check whether this works correctly with non-ASCII separators. as in fr_FR.utf8 locale, the thousands separator is 'NARROW NO-BREAK SPACE' (U+202F).

src/common/intl.cpp

nkottary added 3 commits April 5, 2020 14:02

WIP: Thousands separators with locale grouping

5c07878

Fixes

1256958

Fix default grouping for mac

1c574cd

vadz requested changes Apr 6, 2020

View reviewed changes

include/wx/numformatter.h Show resolved Hide resolved

interface/wx/intl.h Show resolved Hide resolved

src/common/intl.cpp Outdated Show resolved Hide resolved

src/common/intl.cpp Outdated Show resolved Hide resolved

vadz added the work needed Too useful to close, but can't be applied in current state label Apr 6, 2020

nkottary and others added 4 commits April 7, 2020 16:46

restore GetThousandsSeparatorIfUsed method

a95ac3b

Update interface/wx/intl.h

c38d186

Co-Authored-By: VZ <vz-github@zeitlins.org>

Restore blank line

15a5d98

Fix mac string

7c910a5

nkottary force-pushed the nk/grouping branch from f1b23bf to 7c910a5 Compare April 7, 2020 13:52

Blank line fix

62d3ef3

vadz reviewed Apr 8, 2020

View reviewed changes

interface/wx/numformatter.h Show resolved Hide resolved

Add version in doc, format grouping string to windows format

cf4d612

nkottary force-pushed the nk/grouping branch from 709fc07 to cf4d612 Compare April 11, 2020 11:47

Attempt grouping on Mac OS

5b46fd3

nkottary closed this Apr 11, 2020

nkottary reopened this Apr 11, 2020

nkottary added 2 commits April 12, 2020 09:30

Fix build issues

d83f3ca

Bracket around case

90744e3

nkottary closed this Apr 12, 2020

nkottary reopened this Apr 12, 2020

Refactor, add test for Indian locale

8414b63

nkottary added 2 commits April 26, 2020 13:26

Add debug printing of grouping and locale

61fcec2

More debug info

1f6b5b8

Remove debug prints, dont test indian locale if grouping is incorrect

9bb2520

Add tests for FormatNumber

67b945e

nkottary closed this Nov 5, 2020

nkottary reopened this Nov 5, 2020

vadz reviewed Nov 5, 2020

View reviewed changes

src/common/intl.cpp Outdated Show resolved Hide resolved

vadz and others added 3 commits November 6, 2020 00:31

Apply suggestions from code review

99549bc

Maintain alphabetical order in langtbl.txt

a2ef888

Remove null check

9ed8ad9

vadz mentioned this pull request Aug 15, 2021

Add wxUILocale class usable under macOS #2464

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use locale grouping to format numbers #1781

Use locale grouping to format numbers #1781

nkottary commented Apr 6, 2020

vadz left a comment

nkottary commented Apr 6, 2020

vadz left a comment

nkottary commented Apr 11, 2020

vadz commented Apr 19, 2020

nkottary commented Apr 22, 2020

vadz commented Apr 22, 2020

nkottary commented Apr 26, 2020

vadz commented Apr 26, 2020

nkottary commented Apr 27, 2020

nkottary commented Apr 27, 2020

nkottary commented Nov 5, 2020

vadz left a comment

vadz Nov 5, 2020

nkottary Nov 6, 2020

vadz Nov 5, 2020

nkottary Nov 6, 2020

vadz Nov 5, 2020

nkottary Nov 6, 2020

nkottary Nov 6, 2020

vadz Nov 5, 2020

Use locale grouping to format numbers #1781

Are you sure you want to change the base?

Use locale grouping to format numbers #1781

Conversation

nkottary commented Apr 6, 2020

vadz left a comment

Choose a reason for hiding this comment

nkottary commented Apr 6, 2020

vadz left a comment

Choose a reason for hiding this comment

nkottary commented Apr 11, 2020

vadz commented Apr 19, 2020

nkottary commented Apr 22, 2020

vadz commented Apr 22, 2020

nkottary commented Apr 26, 2020

vadz commented Apr 26, 2020

nkottary commented Apr 27, 2020

nkottary commented Apr 27, 2020

nkottary commented Nov 5, 2020

vadz left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment