Addition of mp_radix_size_overestimate (behaviour like the function in GMP) #343

czurnieden · 2019-09-06T16:11:21Z

Also changed mp_radix_size to use mp_ilogb while deprecating it at the same time and rewrote mp_fwrite to use mp_radix_sizeinbase instead.

See comment in bn_mp_fwrite.c for some of the consequences.

czurnieden · 2019-09-06T16:22:08Z

Oh, using deprectated functions (added test for mp_radix_size in test.c) is a full error, ok.
But only with GCC not clang?

I don't understand the Windows error but I assume it is for the same reason?

czurnieden · 2019-09-06T17:02:33Z

May I propose (again) to kill MP_8BIT?
'Cause it's killing me! ;-)

sjaeckel · 2019-09-07T08:40:29Z

May I propose (again) to kill MP_8BIT?
'Cause it's killing me! ;-)

let it rot, disable what doesn't work and add it to the deprecation list, we'll remove it after the next release

czurnieden · 2019-09-07T15:01:13Z

let it rot, disable what doesn't work and add it to the deprecation list, we'll remove it after the next release

One of the handful of 8-bit MCUs which have enough Flash and RAM to run a stripped down LTM with enough circulation to make it worth the hassle are the bigger versions of ATxmega (and the biggest versions of ATmega to some extent). Will see if I can get my hands on one the next couple of days. No virtual MCUs here, I need the power consumption with the internal clock (datasheet gives only the values with an external clock?), too.
Is there some kind of main forum where most of the fanbois sit?

And how much work would it be?

czurnieden ~/GITHUB/libtommath O (radix_sizeinbase)$ find ./ -type f  -name '*c' -exec grep  MP_8BIT {} \; -print
#ifdef MP_8BIT
       * MP_8BIT (It is unknown if the Lucas-Selfridge test works with 16-bit
#if defined (MP_8BIT) || defined (LTM_USE_FROBENIUS_TEST)
#ifdef MP_8BIT
#ifdef MP_8BIT
./bn_mp_prime_is_prime.c
#ifndef MP_8BIT
#ifdef MP_8BIT
#ifdef MP_8BIT
./demo/test.c
#ifdef MP_8BIT
./demo/main.c
#ifndef MP_8BIT
./bn_mp_read_unsigned_bin.c
#ifndef MP_8BIT
./bn_mp_to_unsigned_bin.c
#ifndef MP_8BIT
   /* CZ TODO: Some of them need the full 32 bit, hence the (temporary) exclusion of MP_8BIT */
./bn_mp_prime_strong_lucas_selfridge.c
#ifndef MP_8BIT
./bn_prime_tab.c
      Since mp_radix_sizeinbase() can overshoot by one (two with MP_8BIT)
./bn_mp_fwrite.c
#if !defined(MP_8BIT)
#if defined(MP_64BIT) || !(defined(MP_8BIT) || defined(MP_16BIT))
./bn_mp_montgomery_setup.c
#ifdef MP_8BIT
#if ( (defined MP_8BIT) || (defined MP_16BIT) )
#if ( (defined MP_8BIT) || (defined MP_16BIT) )
#if ( (defined MP_8BIT) || (defined MP_16BIT) )
   /* There is no mp_set_u16 for MP_8BIT */
#if ( (defined MP_8BIT) && (INT_MAX > 0xFFFF))
#if ( (defined MP_8BIT) || (defined MP_16BIT) )
./bn_mp_radix_sizeinbase.c
#ifdef MP_8BIT
./mtest/mtest.c
#ifdef MP_8BIT
      /* (32764^2 - 4) < 2^31, no bigint for >MP_8BIT needed) */
./bn_mp_prime_frobenius_underwood.

Oh, that's not much, thought it to be more!
Plus wiki and "issue"-entry, bn.tex, ChangeLog, and the little hints you metioned here and there (deprecation).
Nothing I would call exhausting.
OK. let's make it official (in another PR).

Oh, am away tonight, now, actually, (brother-in-law has his 50th and who says no to a free dinner? ;-) ), so till tomorrow.

sjaeckel

Looks good!

How about calling this mp_radix_size_approx()? ... or I already thought if it'd probably make sense to have a library-setting to select between the two modes ... but that's probably too much and brings more problems than advantages...

bn_mp_fwrite.c

sjaeckel · 2019-09-07T15:24:33Z

OK. let's make it official (in another PR).

👍

Oh, am away tonight, now, actually, (brother-in-law has his 50th and who says no to a free dinner? ;-) ), so till tomorrow.

Enjoy & have fun!

sjaeckel · 2019-09-08T19:46:44Z

How about calling this mp_radix_size_approx()?

or mp_radix_size_fast()?

minad · 2019-09-10T11:51:38Z

mp_radix_size_overestimate?

czurnieden · 2019-09-11T19:07:40Z

mp_radix_size_overestimate?

*grmbl*

This commit is just the renaming of mp_radix_sizeinbase to mp_radix_size_overestimate, all the rest will be handled in #348 if necessary.

(helper.pl --update-files does not seem to format tommath.def and tommath_class.h correctly. Is that a known problem?)

minad · 2019-10-01T16:17:49Z

@czurnieden the lookup tables look rather large. Could we use smaller tables and a more crude estimate? I think that might be preferable for embedded systems. We will pay more for the allocations then but the constant costs of the tables would be smaller.

However allocators are usually rounding themselves so a more crude estimate might not be too bad.

czurnieden · 2019-10-01T17:28:55Z

the lookup tables look rather large

They are 520 bytes(!) large (260 for 8-bit as long as we have MP_8BIT).
Half a kilobyte. We will drop 8-bit support in the near future, so the most memory-restricted MCUs will not get supported anymore at that time.

But I know where you are coming from and as I am there myself, too, from time to time, I understand.
Mmh…
I can offer a stripped down version with the same accuracy that is much smaller but only supports all powers of two (as they are calculated directly) and a hardcoded base 10. This would get rid of the table(s) and all the logic to handle the tables and will safe about 1k (probably less).

However allocators are usually rounding themselves so a more crude estimate might not be too bad.

You can split the tables in halve (e.g.: using a 16-bit type instead of a 32-bit one) but that would loose way too much of the accuracy. Yes, I played with it while designing this function but had a hard time to guarantee the -0 in the -0/+x tolerance with such insufficient accuracy without an "extra" on top. How large does that "extra" need to be? Hard to tell without a proper error-analysis.

The alternatives, e.g: a fixed-point log, would add a lot of logic which I doubt needs less memory than the tables.

So the only thing I can come up with is the functionally stripped down version described above.

Which lead me to the question: do we really need all the bases? We got a quick base-10 (didn't take a closer look, but looks quite hardcoded, don't know how much work it would be to extend it) and we can rewrite the old loop to get the power-of-two bases directly.

minad · 2019-10-02T19:59:07Z

Ok. 500 bytes are small enough I guess. What's the status?

minad · 2019-10-02T20:01:34Z

We could add a configuration option? MP_RADIX_ALL_BASES? Otherwise only enable 10 and power of 2?

czurnieden · 2019-10-02T21:19:35Z

What's the status?

We are only waiting for your OK, so if you are OK with it, drop me a note and I will squash&finish.

We could add a configuration option? MP_RADIX_ALL_BASES? Otherwise only enable 10 and power of 2?

So no "squash&finish" here?
But serious: what kind of configuration, just the macro MP_RADIX_ALL_BASES or something a bit more sophisticated (whatever that might be)? I haven't implemented the only base 10 and powers of 2 yet, please give me a day (which means "wee hours of the night" in my case ;-) ) to do so.

czurnieden · 2019-10-02T22:22:51Z

@minad just a quick hack as an example. Do you want something like that?

minad · 2019-10-03T05:36:10Z

Hmm I am not sure if I like further complications of this function

czurnieden · 2019-10-03T16:08:19Z

We could add a configuration option? MP_RADIX_ALL_BASES? Otherwise only enable 10 and power of 2?

just a quick hack as an example

Hmm I am not sure if I like further complications of this function

Uhm…ooookaaaay?

So, can I take that as the answer "No." to my question

Do you want something like that?

?

minad · 2019-10-03T18:03:09Z

@sjaeckel @czurnieden What do you think? Is this ready or push this after 1.2 since it is a strict addition? I would love if we could get it out sooner than later since there are already so many changes. After that we can get rid of the deprecated stuff hopefully. 2.0 can then take a while to mature before release, including additions like the this or faster mp_to_radix. Since 1.2 is more like a backward compatibility release I think that would be fine.

czurnieden · 2019-10-03T18:18:27Z

Is this ready or push this after 1.2 since it is a strict addition?

It was ready for 1.2 but isn't anymore with the new branch MP_RADIX_ALL_BASES. We need to either define MP_RADIX_ALL_BASES per default, invert the branch with MP_RADIX_REDUCED_BASES, or adapt mp_to_radix, too (a bit more work but not that much).

But the biggest problem ist the reduction of the functionality itself. Not much has been changed up to commit 724db0a that couldn't get squeezed into 1.2 but reducing functionality is a bit much. Either invert the logic (MP_RADIX_REDUCED_BASES) or roll back to 724db0a for 1.2

minad · 2019-10-03T18:23:49Z

MP_RADIX_BASES was just me thinking too loud. As I said, i am not sure if we want that due to the complications it brings. But we can still discuss all that.

My point was more to get 1.2 out at some point since I am getting worried with all the changes (API types, deprecations etc etc). 2.0 would give us a bit of a clean slate (think size_t issues, 8bit gone etc).

sjaeckel

looks good IMO after those minor nitpicks

sjaeckel · 2019-10-07T11:04:45Z