Handle the limit of p = 0 in p log2 p #3

jftsang · 2021-06-14T14:28:47Z

This patch defines a helper function, _xlog2x(x), that calculates
x * log2(x) but handles the case x == 0 by returning 0 rather than nan.
This is needed if the power spectrum has any component that is exactly
zero: in particular, if the f = 0 component is zero.

This patch defines a helper function, _xlog2x(x), that calculates x * log2(x) but handles the case x == 0 by returning 0 rather than nan. This is needed if the power spectrum has any component that is exactly zero: in particular, if the f = 0 component is zero.

raphaelvallat · 2021-06-15T21:52:24Z

Hi @jftsang,

Thanks for the PR! A few questions:

Do you have any scientific reference (or any kind of documentation) for why this should be the preferred behavior?
Should this modification only affect the spectral entropy function?
Could you explain what the @np.vectorize decorator is for?

Thanks,
Raphael

jftsang · 2021-06-16T14:25:39Z

Hi @raphaelvallat,

The limit of x log x as x tends to 0 is 0; it follows from l'Hopital's rule. Try https://math.stackexchange.com/questions/470952/limit-of-x-log-x-as-x-tends-to-0, and see this illustration: https://www.wolframalpha.com/input/?i=limit+of+x*log%28x%29+as+x+-%3E+0. I'll try and find a proper academic reference when I get home.
I don't know much about the other entropies but I think this result applies whenever a p log p appears, so that your entropy is zero and not undefined.
The @np.vectorize decorator allows you to apply the function to a numpy array rather than a single number. We need this because of the if x == 0. Without the decorator, 'the truth value of an array with more than one element is ambiguous'.

Cheers,
Joanna

antropy/entropy.py

raphaelvallat · 2021-06-16T22:45:21Z

This is all perfect, thanks! One last thing before I merge: can you add your changes to the docs/changelog.rst file (with link to the current PR and if desired your GitHub username)? You'll need to start a new version of antropy, i.e. v0.1.5

Cheers,
Raphael

Pull request raphaelvallat#3

jftsang · 2021-06-17T09:43:16Z

Done! I've also added a couple of unit tests.

~JMFT

raphaelvallat · 2021-06-17T15:20:15Z

Merging now, thanks again for the PR!

jftsang · 2021-06-24T10:24:30Z

Having tested this on a very large file, I have just realised that this _xlog2x function is significantly slower, I suspect because of the conditional test. I shall experiment with using np.nan_to_num instead, which I suspect will be much faster. Sorry for the inconvenience!

Follow up to raphaelvallat#3 Using np.nan_to_num is advantageous because it makes use of numpy's vectorization, instead of 'if x == 0', which applies the test pointwise.

Follow up to raphaelvallat#3. Using `np.where` is advantageous because it makes use of numpy's vectorization, instead of `if x == 0`, which applies the test pointwise. Using `@jit(nopython=True)` is also advantageous.

raphaelvallat self-requested a review June 15, 2021 21:40

raphaelvallat self-assigned this Jun 15, 2021

jftsang commented Jun 16, 2021

View reviewed changes

antropy/entropy.py Show resolved Hide resolved

jftsang commented Jun 16, 2021

View reviewed changes

antropy/entropy.py Show resolved Hide resolved

jftsang added 2 commits June 17, 2021 10:31

Update changelog for pull request raphaelvallat#3

4ee2f0a

Unit tests for _xlog2x

c85d731

Pull request raphaelvallat#3

raphaelvallat approved these changes Jun 17, 2021

View reviewed changes

raphaelvallat merged commit 4ba03dc into raphaelvallat:master Jun 17, 2021

jftsang mentioned this pull request Jun 24, 2021

Improve performance in _xlog2x #8

Merged

raphaelvallat mentioned this pull request Sep 20, 2021

RuntimeWarning in _xlogx when x has zero values #10

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle the limit of p = 0 in p log2 p #3

Handle the limit of p = 0 in p log2 p #3

jftsang commented Jun 14, 2021

raphaelvallat commented Jun 15, 2021

jftsang commented Jun 16, 2021 •

edited

raphaelvallat commented Jun 16, 2021

jftsang commented Jun 17, 2021

raphaelvallat commented Jun 17, 2021

jftsang commented Jun 24, 2021

Handle the limit of p = 0 in p log2 p #3

Handle the limit of p = 0 in p log2 p #3

Conversation

jftsang commented Jun 14, 2021

raphaelvallat commented Jun 15, 2021

jftsang commented Jun 16, 2021 • edited

raphaelvallat commented Jun 16, 2021

jftsang commented Jun 17, 2021

raphaelvallat commented Jun 17, 2021

jftsang commented Jun 24, 2021

jftsang commented Jun 16, 2021 •

edited