Add convenience functions nans, infs, nans_like, infs_like #2875

ahojnnes · 2013-01-01T09:07:51Z

I find myself often in the situation where I type the long version of this:

b = np.empty_like(a, dtype=np.double)
b[:] = np.inf

Please let me know if you find this useful as well and I will add test cases for those functions.

njsmith · 2013-01-02T14:20:33Z

Hmm, this is creating something of a proliferation of different functions. I think the usual way to do this is to just write

b = np.ones_like(a, dtype=np.double) * np.inf

but obviously that is substantially less efficient than the multi-line version you suggested (it creates and initializes two separate arrays).

Maybe we should add np.filled and np.filled_like functions instead, which take an argument specifying the fill value? Still not entirely sure whether it'd be worth adding to the numpy api, but it does seem like a useful convenience and less conceptually cluttered than the current patch. Possibly we could also simplify numpy's code by reimplementing zeros, zeros_like, etc., in terms of these new functions.

ewmoore · 2013-01-05T21:00:01Z

I think that this is unnecessary. Adding np.filled or np.filled_like would be better. I think the idiomatic way to do this right now is:
b = np.empty_like(a, dtype=np.double).fill(np.inf)
This is substantially faster than the technique shown above.

In [11]: timeit np.empty(4096).fill(np.inf)
10000 loops, best of 3: 29.2 us per loop

In [12]: timeit np.ones(4096) * np.inf
1000 loops, best of 3: 1.08 ms per loop

njsmith · 2013-01-05T21:36:29Z

That can't be the idiomatic way, b/c .fill doesn't return self :-)

In [10]: b = np.empty_like(a, dtype=np.double).fill(np.inf)

In [11]: b is None
True

So I think right now the * is the only single-statement way to do this,
even though it's dumb and slow.

I guess I'm +0.5, maybe even +1, on np.filled/np.filled_like.

On Sat, Jan 5, 2013 at 9:00 PM, Eric Moore notifications@github.com wrote:

I think that this is unnecessary. Adding np.filled or np.filled_likewould be better. I think the idiomatic way to do this right now is:
b = np.empty_like(a, dtype=np.double).fill(np.inf)
This is substantially faster than the technique shown above.

In [11]: timeit np.empty(4096).fill(np.inf)
10000 loops, best of 3: 29.2 us per loop

In [12]: timeit np.ones(4096) * np.inf
1000 loops, best of 3: 1.08 ms per loop

—
Reply to this email directly or view it on GitHubhttps://github.com//pull/2875#issuecomment-11919794.

ewmoore · 2013-01-05T23:15:03Z

Touche. That said, my point stands. Using a multiply to fill an array is a terrible idea. Even if the better way takes two lines rather than one.

b = np.empty_like(a, dtype=np,double)
b.fill(np.inf)

Given this I think filled/filled_like are an okay idea.

ahojnnes · 2013-01-06T13:58:18Z

I agree, filled and filled_like are the best option. Give me some days and I will implement those two functions as part of this PR.

ahojnnes · 2013-01-13T12:02:34Z

filled and filled_like including tests added.

njsmith · 2013-01-13T17:19:05Z

I see tests for filled_like, but not for filled. Am I missing something, or do you need to add more tests? :-)

ahojnnes · 2013-01-13T18:34:31Z

Yes, I forgot about that because I couldn't find tests for ones and zeros. Do you know where I can find those? If those do not yet exist, I could implement those as well, of course.

njsmith · 2013-01-13T18:47:30Z

I'm not finding any tests for the basic ones/zeros functionality either.
They get some coverage from being used in other tests, but it's probably
not thorough WRT things like different shapes, dtypes, orders, etc. If
you'd like to add tests for all of them at once that would be great :-)

(And it's possible that we're both just missing them, but a little
duplication in the test suite is far better than having nothing in the test
suite!)

On Sun, Jan 13, 2013 at 6:34 PM, Johannes Schönberger <
notifications@github.com> wrote:

Yes, I forgot about that because I couldn't find tests for ones and zeros.
Do you know where I can find those? If those do not yet exist, I could
implement those as well, of course.

—
Reply to this email directly or view it on GitHubhttps://github.com//pull/2875#issuecomment-12197751.

ahojnnes · 2013-01-13T19:50:41Z

Added test cases.

njsmith · 2013-01-13T21:43:24Z

TestCreationFuncs is failing on Python 3: https://travis-ci.org/numpy/numpy/builds/4128136

ahojnnes · 2013-01-14T14:55:33Z

I don't have a running Python 3 environment here, so I skip all string/unicode comparisons. Maybe someone else can have a look into this. Maybe this is a bug?

seberg · 2013-01-14T15:14:10Z

The difference is:

Python 2.7:

In [8]: np.dtype('S1').type(0)
Out[8]: '0'

In [9]: np.dtype('S0')
Out[9]: dtype('S')

Python 3.2:

In [8]: np.dtype('S1').type(0)
Out[8]: b''

In [9]: np.dtype('S0')
TypeError: data type not understood

The second difference does not really matter much I believe, 'S0' is not really a sensible type anyway. But the first one is an inconsistency between the two and I believe as such a bug in python 3 string handling.

ahojnnes · 2013-01-17T17:10:24Z

Maybe someone with more knowledge regarding the specifics of Python 3 string handling can have another look at this and tell what he/she thinks?

ahojnnes · 2013-03-14T14:18:50Z

Has someone had time to look into this yet? If not I'll tackle this in the coming days.

charris · 2013-05-04T17:20:09Z

In Python3.3:

>>> np.dtype('S1').type(0)
b''
>>> np.dtype('S0')
dtype('S')

So the problem looks to be only with 3.2. Now that we are not supporting Python < 2.6 the b prefix can be used, in Python2 it just gets ignored.

charris · 2013-05-04T17:20:46Z

@seberg Looks like you thought this was good apart from the errors.

ahojnnes · 2013-05-04T19:36:02Z

Rebased on current master and fixed the Python 3 test cases.

Tell me if there is anything else you need me to do.

charris · 2013-05-04T20:31:14Z

Don't know what's going on with the Travis bot notification here, all the tests seem to have passed.

The new functions need to be mentioned in doc/release/1.8.0-notes.rst.

charris · 2013-05-04T20:36:27Z

numpy/core/numeric.py

+
+    Please refer to the documentation for `zeros` for further details.
+
+    Other parameters


I'd just copy the documentation from zeros, no point in sending people around the 'see blah' loop. I think the main justification for that practice was keeping things in sync but I don't think the documentation gets updated that often.

Should I also do it for ones, … etc.?

ahojnnes · 2013-05-06T19:07:55Z

@charris Doc strings updated with separate parameter description and short example sections.

ahojnnes · 2013-05-06T20:09:00Z

I guess all remaining tasks should be addressed now. Please, check if the release notes are appropriate?

njsmith · 2013-05-06T20:20:37Z

Looks good to me, but one more subtle API question: should the default dtype for filled be np.float64, or np.array(fill_value).dtype?

ahojnnes · 2013-05-06T20:51:50Z

That's actually a good question. On the one hand np.float64 is more consistent with the existing zeros etc. functions, on the other hand np.array(fill_value).dtype is definitely applicable to more types of input.

I'm +1 on changing it to np.array(fill_value).dtype. Any other opinions?

ahojnnes · 2013-05-20T10:47:47Z

I checked the behavior of the current implementation and it is already the case that np.array(fill_value).dtype is used the dtype for the filled function.

njsmith · 2013-05-22T09:31:28Z

I like np.array(fill_value).dtype better too.

The docs currently say that filled uses float64 by default, so that would still need to be changed.

You have a merge conflict that needs resolving.

ahojnnes · 2013-05-24T17:14:47Z

Doc string updated and rebased on current state of master branch.

ahojnnes · 2013-06-06T17:21:22Z

/ping

charris · 2013-06-06T17:27:46Z

Needs a rebase,

ahojnnes · 2013-06-06T19:16:16Z

@charris Rebased on current master.

ahojnnes · 2013-06-11T17:44:15Z

/ping

charris · 2013-06-12T02:10:12Z

@njsmith Is this ready?

njsmith · 2013-06-12T11:57:25Z

Sigh.

The code looks great to me and I like the API. But I just went back and looked at the mailing list discussion:
~~http://thread.gmane.org/gmane.comp.python.numeric.general/52763~~ archives link

We're going to have a mess if we just declare this done and merge it as is, because now that I remind myself, the general opinion in that thread seems to have been:

You can't possibly call this thing filled because that conflicts with np.ma.filled. But there are no better names.
Therefore the best solution is to call it is np.empty(shape, fill=value) instead.

My personal opinion is that this argument is completely wrong. Making np.ma's API clean is not as important as making numpy proper's API clean, esp. since np.ma already is already inconsistent with the word 'fill' being used for totally different things. So we can just live with the inconsistency between np.filled and np.ma.filled, or can deprecate np.ma.filled and tell people to use the method version instead (which already exists). And the np.empty idea is just horribly ugly, will confuse newbies and everyone else forever, and I'm 100% convinced we would regret it.

But, just ignoring the discussion and merging anyway will do more harm than good, even if everyone does eventually agree the PR is totally awesome...

I guess there's not really anything we can do except send another email and re-open the debate.

Sigh.

ahojnnes · 2013-06-12T13:42:06Z

If naming is an issue, how about np.vals(…) and np.vals_like(…)?

njsmith · 2013-06-12T13:50:33Z

I think that suggestion did come up in the last thread, though I don't
remember the conclusion off-hand.

New thread: ~~http://thread.gmane.org/gmane.comp.python.numeric.general/54467~~ archive link

On Wed, Jun 12, 2013 at 2:42 PM, Johannes Schönberger <
notifications@github.com> wrote:

If naming is an issue, how about np.vals(…) and np.vals_like(…)?

—
Reply to this email directly or view it on GitHubhttps://github.com//pull/2875#issuecomment-19325972
.

ahojnnes · 2013-06-12T13:50:54Z

Nevertheless, I can bring up this topic on the mailing list again.

njsmith · 2013-06-12T13:56:34Z

Might be better to wait a day or so to see if the current 'filled' names go
through this time and only bring that up if not... thread length tends to
be exponential in the number of ideas suggested, and probability of
successful resolution is inversely proportional to thread length.

On Wed, Jun 12, 2013 at 2:50 PM, Johannes Schönberger <
notifications@github.com> wrote:

Nevertheless, I can bring up this topic on the mailing list again.

—
Reply to this email directly or view it on GitHubhttps://github.com//pull/2875#issuecomment-19326572
.

ahojnnes · 2013-06-18T18:46:08Z

@njsmith Sorry, it is hard for me as an outside person to follow the discussion, but we do not seem to approach any consensus as far as I can see, do we? ;-)

njsmith · 2013-06-30T10:18:51Z

@ahojnnes - Phew. At long last, it looks like people are all okay with full and full_like -- want to do a quick search/replace and get this merged before someone changes their mind? :-)

(Sorry this has been such a hassle. Thanks for sticking with it so far!)

ahojnnes · 2013-06-30T10:34:00Z

@njsmith No problem. Now, we only need Travis to be happy with the changes... ;-)

Add convenience functions nans, infs, nans_like, infs_like

charris reviewed May 4, 2013
View reviewed changes

Add nans, infs, nans_like, infs_like convenience functions

87d884e

ahojnnes and others added 14 commits June 6, 2013 21:15

Add tests for filled_like function

474ec48

Use commin method to compare array values

a677232

Use more idiomatic way of None-check

587b092

Add filled_like to doc string of test class

5be86a8

Add tests for zeros, ones, empty and filled

5991bbe

Comment test cases

64d236c

Add doc string to creation test class

781cb48

Do not compare string or unicode values

dbc3558

Fix Python 3 test cases for filled* functions

8ed3733

Add separate parameter description to filled, filled_like and ones

494fa21

Add examples to doc string of filled and filled_like

91b1b99

Rename fill value parameter for consistency across numpy+

8b6ccd9

Add filled and filled_like to changelog of 1.8.0

b4b20dc

Update default dtype of filled function in doc string

7759766

Rename filled, filled_like to full, full_like

70cb9e5

njsmith added a commit that referenced this pull request Jun 30, 2013

Merge pull request #2875 from ahojnnes/array-init

29dcc54

Add convenience functions nans, infs, nans_like, infs_like

njsmith merged commit 29dcc54 into numpy:master Jun 30, 2013

anntzer mentioned this pull request Sep 27, 2015

np.full and object arrays #6366

Closed

WarrenWeckesser mentioned this pull request Jul 31, 2024

full and full_like permit out-of-bounds Python integers #27075

Closed


		Please refer to the documentation for `zeros` for further details.

		Other parameters

Add convenience functions nans, infs, nans_like, infs_like #2875

Add convenience functions nans, infs, nans_like, infs_like #2875

Conversation

ahojnnes commented Jan 1, 2013

njsmith commented Jan 2, 2013

ewmoore commented Jan 5, 2013

njsmith commented Jan 5, 2013

ewmoore commented Jan 5, 2013

ahojnnes commented Jan 6, 2013

ahojnnes commented Jan 13, 2013

njsmith commented Jan 13, 2013

ahojnnes commented Jan 13, 2013

njsmith commented Jan 13, 2013

ahojnnes commented Jan 13, 2013

njsmith commented Jan 13, 2013

ahojnnes commented Jan 14, 2013

seberg commented Jan 14, 2013

ahojnnes commented Jan 17, 2013

ahojnnes commented Mar 14, 2013

charris commented May 4, 2013

charris commented May 4, 2013

ahojnnes commented May 4, 2013

charris commented May 4, 2013

charris May 4, 2013

Choose a reason for hiding this comment

ahojnnes May 5, 2013

Choose a reason for hiding this comment

ahojnnes commented May 6, 2013

ahojnnes commented May 6, 2013

njsmith commented May 6, 2013

ahojnnes commented May 6, 2013

ahojnnes commented May 20, 2013

njsmith commented May 22, 2013

ahojnnes commented May 24, 2013

ahojnnes commented Jun 6, 2013

charris commented Jun 6, 2013

ahojnnes commented Jun 6, 2013

ahojnnes commented Jun 11, 2013

charris commented Jun 12, 2013

njsmith commented Jun 12, 2013 • edited by WarrenWeckesser Loading

ahojnnes commented Jun 12, 2013

njsmith commented Jun 12, 2013 • edited by WarrenWeckesser Loading

ahojnnes commented Jun 12, 2013

njsmith commented Jun 12, 2013

ahojnnes commented Jun 18, 2013

njsmith commented Jun 30, 2013

ahojnnes commented Jun 30, 2013

njsmith commented Jun 12, 2013 •

edited by WarrenWeckesser

Loading

njsmith commented Jun 12, 2013 •

edited by WarrenWeckesser

Loading