MAINT: Use expm1(x) instead of exp(x) - 1 #15346

gaurav1086 · 2020-01-18T17:39:11Z

np.exp(1e-99) - 1
0.0
np.expm1(1e-99)
1e-99

eric-wieser · 2020-01-18T17:40:42Z

@bashtage, can you comment on stream compatibility here?

bashtage · 2020-01-18T19:14:20Z

Good idea in distributions but legacy should not change unless something isn't compiling. Doesn't really affect the stream except that in very rare cases one might see a different number. I'm not sure that standard exponential would ever produce a value where the precision difference would appear since there are only 2**53 ish distinct values possible.

numpy/random/src/legacy/legacy-distributions.c

bashtage · 2020-01-19T21:26:01Z

Are there numerical difference in reasonable sample sizes? As in any different values generated?

miccoli · 2020-01-19T23:11:24Z

Doesn't really affect the stream except that in very rare cases one might see a different number. I'm not sure that standard exponential would ever produce a value where the precision difference would appear

I do not fully understand what this means, but in double precision, almost 50% of the samples drawn by pareto(a=1.1) would differ by 1 ULP or more, with respect to the old implementation.

bashtage · 2020-01-19T23:35:06Z

I do not fully understand what this means, but in double precision, almost 50% of the samples drawn by pareto(a=1.1) would differ by 1 ULP or more, with respect to the old implementation.

I didn't have any idea for which values of expm1(x) and exp(x)-1 differ. The example at the top (1e-99)doesn't seem useful for random pareto, although It could matter for large values of a. I did a quick check and for a=1 the difference is there and so this seems like a good idea to me in distributions (but not for legacy_distributions).

seberg · 2020-01-21T19:03:30Z

@gaurav1086 would it be possible to add a test that would notice the loss of numerical precision if we changed things back?

gaurav1086 · 2020-01-21T19:06:22Z

@seberg , sure, I think that would be a good idea.

seberg · 2020-01-28T19:48:48Z

@gaurav1086 could you add the small test? Would be nice to finish this up, since it is an obvious improvement. Let me know if you need pointers for doing that.

gaurav1086 · 2020-01-28T19:53:15Z

@seberg , sorry for the delay. Will add it today. Thank you.

gaurav1086 · 2020-01-29T04:12:19Z

@seberg , added the test in numpy/numpy/random/tests/test_random.py

def test_pareto_expm1(self):
assert_(np.random.pareto(1e99) > 0.0)

Currently,

np.random.pareto(1e99)
0.0

After the change, it should be non-zero of very small magnitude. Please correct me if I am wrong. Thank you.

seberg · 2020-01-30T19:23:21Z

@gaurav1086 thanks, the test actually fails on master. On the up-side, it is supposed to fail, since np.random.pareto is the legacy implementation, which you hopefully did not change. You should test rng = np.random.default_rng() (or whatever is commonly used in the tests, it may be a different test file).
Also make sure that the test will never fail, so if there is a chance that 1e99 actually does return 0, please set a seed.

gaurav1086 · 2020-02-03T01:18:20Z

@seberg , I changed random_pareto() in distribution.c . The legacy was another change which I reverted back.

seberg · 2020-02-03T03:28:43Z

@gaurav1086 yes, but your test tests the legacy one, so it fails. Can you fix the test?

numpy/random/tests/test_smoke.py

mattip · 2020-02-06T09:58:53Z

You should try running tests locally before pushing. Your fix does not work:
SeedSequence expects int or sequence of ints for entropy not 1e+99
You can run tests locally with:

python3 -mvenv /tmp/py_to_test
/tmp/py_to_test/bin/python -mpip install -r test_requirements.txt
/tmp/py_to_test/bin/python runtests.py

Use expm1(x) instead of exp(x) - 1 for precision

35c4d1f

gaurav1086 requested a review from eric-wieser January 18, 2020 17:39

eric-wieser added the component: numpy.random label Jan 18, 2020

charris added the 03 - Maintenance label Jan 18, 2020

charris changed the title ~~Use expm1(x) instead of exp(x) - 1 for precision~~ MAINT: Use expm1(x) instead of exp(x) - 1 Jan 19, 2020

eric-wieser reviewed Jan 19, 2020

View reviewed changes

numpy/random/src/legacy/legacy-distributions.c Outdated Show resolved Hide resolved

Code review changes: revert legacy-distribution changes

9ff55a6

expm1: Added test case for pareto

5736d0c

Added tests for np.random.default_rng()

33c9318

mattip reviewed Feb 5, 2020

View reviewed changes

numpy/random/tests/test_smoke.py Show resolved Hide resolved

Remove extra paranthesis

72d5553

gaurav1086 closed this Feb 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MAINT: Use expm1(x) instead of exp(x) - 1 #15346

MAINT: Use expm1(x) instead of exp(x) - 1 #15346

gaurav1086 commented Jan 18, 2020

eric-wieser commented Jan 18, 2020

bashtage commented Jan 18, 2020

bashtage commented Jan 19, 2020

miccoli commented Jan 19, 2020

bashtage commented Jan 19, 2020

seberg commented Jan 21, 2020

gaurav1086 commented Jan 21, 2020

seberg commented Jan 28, 2020

gaurav1086 commented Jan 28, 2020

gaurav1086 commented Jan 29, 2020

seberg commented Jan 30, 2020

gaurav1086 commented Feb 3, 2020

seberg commented Feb 3, 2020

mattip commented Feb 6, 2020

MAINT: Use expm1(x) instead of exp(x) - 1 #15346

MAINT: Use expm1(x) instead of exp(x) - 1 #15346

Conversation

gaurav1086 commented Jan 18, 2020

eric-wieser commented Jan 18, 2020

bashtage commented Jan 18, 2020

bashtage commented Jan 19, 2020

miccoli commented Jan 19, 2020

bashtage commented Jan 19, 2020

seberg commented Jan 21, 2020

gaurav1086 commented Jan 21, 2020

seberg commented Jan 28, 2020

gaurav1086 commented Jan 28, 2020

gaurav1086 commented Jan 29, 2020

seberg commented Jan 30, 2020

gaurav1086 commented Feb 3, 2020

seberg commented Feb 3, 2020

mattip commented Feb 6, 2020