Address numpy deprecation warning #506

OriolAbril · 2024-03-26T16:51:08Z

I am seeing:

DeprecationWarning: Conversion of an array with ndim > 0 to a scalar is deprecated, and will error in future. Ensure you extract a single element from your array before performing this operation. (Deprecated NumPy 1.25.)

from these lines, I think this would fix them. Not really sure how to add a test for that or if you'd prefer a diferent approach.

for more information, see https://pre-commit.ci

dfm · 2024-03-28T16:05:18Z

Thanks! Can you share a code snippet and all the relevant versions where you're seeing this? I can't reproduce this warning.

OriolAbril · 2024-03-28T16:53:09Z

I think the default is to not show DeprecationWarning for users. Try setting

import warnings
warnings.simplefilter('always')

I see it on anything with blobs. haven't checked multiple versions, I am on python 3.10, numpy 1.26.4 and emcee 3.1.4.

Some example snippets:

# single blob
sampler = emcee.EnsembleSampler(6, 1, lambda x: (-(x**2), 3))
sampler.run_mcmc(np.random.normal(size=(6, 1)), 20);
# multiple blobs
sampler = emcee.EnsembleSampler(6, 1, lambda x: (-(x**2), (np.random.normal(x), 3)))
sampler.run_mcmc(np.random.normal(size=(6, 1)), 20);

dfm · 2024-04-02T13:47:55Z

The problem here is really that there is a bug in the log probability that you're specifying. The log probability should always be a scalar, but in this case it has size ndim. The "correct" log prob is probably something like:

- lambda x: (-(x**2), 3)
+ lambda x: (-np.sum(x**2), 3)

I'd actually probably rather raise an exception for non-scalar probability, rather than trying to coerce it into a scalar. I expect that this is a fairly small corner case (log probs that have shape (1,) instead of ())...

That being said, relying on numpy's deprecation for this doesn't seem like a good approach either! Would you be willing to update this PR to explicitly check that the log prob is a scalar?

OriolAbril · 2024-04-02T15:44:12Z

Sounds good, I thought (1,) and scalars were exchangeable, I'll fix my code too. Do you have a preferred way to check something is a scalar? np.ndim(x) == 0?

dfm · 2024-04-02T16:39:04Z

I guess they used to be interchangeable, but if numpy is deprecating this behavior, it might be best for emcee to follow suit!

Do you have a preferred way to check something is a scalar? np.ndim(x) == 0?

This seems sensible to me. Thank you!!

andyfaff · 2024-04-02T21:23:08Z

In scipy we come across this rather more than I'd like, so much jumping through hoops. Anyway for a multivariate function returning a scalar we deal with it as:

            fx = fun(np.copy(x), *args)
            # Make sure the function returns a true scalar
            if not np.isscalar(fx):
                try:
                    fx = np.asarray(fx).item()   # deals with 
                except (TypeError, ValueError) as e:
                    raise ValueError(
                        "The user-provided objective function "
                        "must return a scalar value."
                    ) from e
            else:
                return fx

It should deal with 1.0, np.double(1.0), np.array(1.0), np.array([1.0])

OriolAbril · 2024-04-02T23:53:49Z

I understood the goal wasn't to deal with np.array([1.0]) but to raise an error if encountered, hence the change from the inital use of item to ndim and explicit error raising.

andyfaff · 2024-04-03T01:56:58Z

My comment was from the sidelines, what other projects do in a similar situation, not necessarily what should be done here.

dfm · 2024-04-03T12:47:57Z

Thanks @andyfaff!! This is very useful context.

It's interesting that scipy has decided to handle the deprecation this way, and it definitely makes me less certain about how we want to navigate this. The original motivation for emcee was to be roughly compatible with scipy's optimize interface, so maybe it is right to match that.

Another more radical approach would be to explicitly sum the log prob result down to a scalar, but that would be a bigger change to the expected interface.

In the end I guess I don't have a strong opinion so I'm happy with either raising the error or using the scipy approach that @andyfaff shared above. What do you think @OriolAbril?

Thanks both!!

andyfaff · 2024-04-05T12:00:47Z

I've dealt with this issue, and added unit tests as part of #510. I went with the fix that I suggested.

As mentioned above it will deal with 1.0, np.float64(1.0), np.array(1.0), np.array([1.0]). The latter is quite common in user-land, #509.

OriolAbril · 2024-04-05T16:58:20Z

I've dealt with this issue, and added unit tests as part of #510. I went with the fix that I suggested.

closing this then

dfm · 2024-04-05T16:58:58Z

Thanks, @OriolAbril! And thanks again for tracking this down. I'm happy with where this landed in the end.

OriolAbril and others added 2 commits March 26, 2024 17:50

Address numpy deprecation warning

9bf0c5f

[pre-commit.ci] auto fixes from pre-commit.com hooks

25a23c1

for more information, see https://pre-commit.ci

raise error if log_prob isn't a scalar

71d08fd

OriolAbril closed this Apr 5, 2024

OriolAbril deleted the patch-1 branch April 5, 2024 16:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Address numpy deprecation warning #506

Address numpy deprecation warning #506

OriolAbril commented Mar 26, 2024

dfm commented Mar 28, 2024

OriolAbril commented Mar 28, 2024 •

edited

Loading

dfm commented Apr 2, 2024

OriolAbril commented Apr 2, 2024

dfm commented Apr 2, 2024

andyfaff commented Apr 2, 2024 •

edited

Loading

OriolAbril commented Apr 2, 2024

andyfaff commented Apr 3, 2024

dfm commented Apr 3, 2024

andyfaff commented Apr 5, 2024

OriolAbril commented Apr 5, 2024

dfm commented Apr 5, 2024

Address numpy deprecation warning #506

Address numpy deprecation warning #506

Conversation

OriolAbril commented Mar 26, 2024

dfm commented Mar 28, 2024

OriolAbril commented Mar 28, 2024 • edited Loading

dfm commented Apr 2, 2024

OriolAbril commented Apr 2, 2024

dfm commented Apr 2, 2024

andyfaff commented Apr 2, 2024 • edited Loading

OriolAbril commented Apr 2, 2024

andyfaff commented Apr 3, 2024

dfm commented Apr 3, 2024

andyfaff commented Apr 5, 2024

OriolAbril commented Apr 5, 2024

dfm commented Apr 5, 2024

OriolAbril commented Mar 28, 2024 •

edited

Loading

andyfaff commented Apr 2, 2024 •

edited

Loading