Apply NumPy's random functions using awkward inputs? #489

Duchstf · 2020-10-17T20:25:58Z

Hello,

Thanks for the excellent work!! So I'm working on an application where I want to apply np.random.normal on each element of the awkward array. I'm trying to do the followings:

a = ak.from_iter([[8],[7],[9,11],[5]])
f = lambda x : np.random.normal(x,x*0.15)
# f(a) gives errors
f_arr = np.frompyfunc(f, 1, 1) # try making ufunc
# f_arr(a) also gives errors

I'm wondering if anyone here have suggestions as to what I should do in this case. 😄 Our solution right now is to basically do nested loops to change each element, which is not very efficient.

The text was updated successfully, but these errors were encountered:

jpivarski · 2020-10-17T20:56:00Z

The reason it's not working is because they're isn't an Awkward function overriding this NumPy function. That would be a good feature to add (hadn't thought of it), and it would be a whole new category of overload (it doesn't quite belong in ak.operations.structure, though it would be implemented in a similar way).

For the time being, I guess you'd have to unwrap the a object manually with a.layout.content until you get the underlying NumpyArray, cast this as a np.asarray, compute the random numbers, and then wrap it up as a ListOffsetArray64 using the same offsets as the original, and then put that in a new ak.Array. That's essentially what a built-in function would do, but for general structures, and it's also why such a function would be a nice addition.

Duchstf · 2020-10-17T21:22:48Z

Thank you! There is something I want to clarify:

and then wrap it up as a ListOffsetArray64 using the same offsets as the original, and then put that in a new ak.Array

How exactly can I do this? (Sorry I'm just starting to learn awkward). So suppose I did the following:

b = np.asarray(a.layout.content)
b_random = np.random.normal(b, b*0.15)

What should I do to convert b_random back to an awkward array with the same offsets as a.layout.offsets?

jpivarski · 2020-10-17T22:12:09Z

(I'm waiting from a phone, so it's hard to give examples.)

The a.layout.offsets is one of the two arguments to the ak.layout.ListOffsetArray64 constructor, which puts the same list structure around the random numbers that you've made as the original list had, and then passing this to ak.Array gives it the high level interface of a (same not a.layout). Try this on a terminal to see what I mean. The repr view of these objects should help show what's going on—it's particularly instructive to practice on small objects and then scale up when you understand the structure.

Duchstf · 2020-10-17T22:44:25Z

Well then I guess I would do this to make a new awkward array with the same offset?

ak.Array(ak.layout.ListOffsetArray64(a.layout.offsets, ak.Array(b_random).layout))

Duchstf · 2020-10-17T22:58:37Z

This is my whole function, would it be similar to what you have in mind for future implementation?

def smear(arr):
    
    from awkward1 import Array
    from awkward1.layout import ListOffsetArray64
    
    #Convert it to a 1D numpy array and perform smearing
    numpy_arr = np.asarray(arr.layout.content)
    smeared_arr = np.random.normal(numpy_arr, numpy_arr*0.15)
    
    #Convert it back to awkward form
    return Array(ListOffsetArray64(arr.layout.offsets, Array(smeared_arr).layout))

jpivarski · 2020-10-18T00:39:14Z

Your NumpyArray is unnecessarily wrapped (Array) and unwrapped (.layout), but otherwise, yes. Also, the general function would work for all data structures. (There's an internal ak._util.broadcast_and_apply that generalizes the process of unwrapping and re-wrapping. That's how all of the ufuncs work. But since it's an internal function and not a part of the stable API, you should use the technique you use here.)

Oh! I guess the reason you wrapped and unwrapped the NumpyArray is because you didn't know it was called that. :) Your can use ak.layout.NumpyArray instead of Array and then .layout. The effect is the same, but it's more direct.

Duchstf · 2020-10-18T00:45:05Z

Thanks for your help!

jpivarski · 2020-10-18T13:48:40Z

I'm reopening this as a reminder to add the feature.

Duchstf · 2020-10-18T14:26:23Z

Ok, also if you can point me to where to look at I'll be willing to make the PR for the feature too!

jpivarski · 2020-10-18T15:09:26Z

I'll want to start a new submodule for this, so it might be done before there's enough of a pattern to build on. However, I could start with the randomization functions and if there are any others you need, it should be clear how to build on that pattern.

jpivarski · 2022-04-15T19:49:31Z

Closing this one because it's a time-traveling duplicate.

Duchstf closed this as completed Oct 18, 2020

jpivarski changed the title ~~How to apply ufunc on awkward arrays?~~ Apply NumPy's random functions using awkward inputs? Oct 18, 2020

jpivarski added the feature New feature or request label Oct 18, 2020

jpivarski reopened this Oct 18, 2020

jpivarski mentioned this issue Oct 21, 2020

High-level function for selecting collections of different jaggedness; "broadcast-slice"? #492

Open

jpivarski mentioned this issue Jan 18, 2022

New random functions #1230

Open

jpivarski added the duplicate This issue or pull request already exists label Apr 15, 2022

jpivarski closed this as completed Apr 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Apply NumPy's random functions using awkward inputs? #489

Apply NumPy's random functions using awkward inputs? #489

Duchstf commented Oct 17, 2020

jpivarski commented Oct 17, 2020

Duchstf commented Oct 17, 2020 •

edited

Loading

jpivarski commented Oct 17, 2020

Duchstf commented Oct 17, 2020

Duchstf commented Oct 17, 2020

jpivarski commented Oct 18, 2020

Duchstf commented Oct 18, 2020

jpivarski commented Oct 18, 2020

Duchstf commented Oct 18, 2020

jpivarski commented Oct 18, 2020

jpivarski commented Apr 15, 2022

Apply NumPy's random functions using awkward inputs? #489

Apply NumPy's random functions using awkward inputs? #489

Comments

Duchstf commented Oct 17, 2020

jpivarski commented Oct 17, 2020

Duchstf commented Oct 17, 2020 • edited Loading

jpivarski commented Oct 17, 2020

Duchstf commented Oct 17, 2020

Duchstf commented Oct 17, 2020

jpivarski commented Oct 18, 2020

Duchstf commented Oct 18, 2020

jpivarski commented Oct 18, 2020

Duchstf commented Oct 18, 2020

jpivarski commented Oct 18, 2020

jpivarski commented Apr 15, 2022

Duchstf commented Oct 17, 2020 •

edited

Loading