Feature Request: Function as first-class type #3405

NicolasHug · 2018-10-12T19:50:20Z

I am using the latest released version of Numba

~ » python -c "import numba; print(numba.__version__)" 
0.40.0

I have included below a minimal working reproducer

~~I am getting a type unification error that I can't logically explain:~~

~~the same code works when I'm not inside a jitclass (see foo() vs C.foo())~~
~~even inside the jitclass, the unification can work under some conditions (see C.bar)~~

EDIT: as noticed by @ogrisel this isn't related to jitclass but more generally to the nopython mode. I updated the example and the log.

Minimal working example:

from numba import njit, jit, jitclass

@njit
def a():
    return 2

@njit
def b():
    return 3

def c(arg):  # works
    if arg:
        f = a
    else:
        f = b
    out = f()
    return out

@jit
def d(arg):  # works
    if arg:
        f = a
    else:
        f = b
    out = f()
    return out

@njit
def e(arg):  # works
    f = a
    f = b
    out = f()
    return out

@njit
def g(arg):  # fails
    if arg:
        f = a
    else:
        f = b
    out = f()
    return out

# OK
c(True)
c(False)

# OK
d(True)
d(False)

# OK
e(True)
e(False)

# KO
g(True)
g(False)

Error log:

Traceback (most recent call last):
  File "lol.py", line 57, in <module>
    g(True)
  File "/home/nico/.virtualenvs/pygbm/lib/python3.7/site-packages/numba/dispatcher.py", line 348, in _compile_for_args
    error_rewrite(e, 'typing')
  File "/home/nico/.virtualenvs/pygbm/lib/python3.7/site-packages/numba/dispatcher.py", line 315, in error_rewrite
    reraise(type(e), e, None)
  File "/home/nico/.virtualenvs/pygbm/lib/python3.7/site-packages/numba/six.py", line 658, in reraise
    raise value.with_traceback(tb)
numba.errors.TypingError: Failed in nopython mode pipeline (step: nopython frontend)
Cannot unify type(CPUDispatcher(<function a at 0x7fd67bd5af28>)) and type(CPUDispatcher(<function b at 0x7fd66bdb0378>)) for 'f', defined at lol.py (38)

File "lol.py", line 38:
def g(arg):  # fails
    <source elided>
    if arg:
        f = a
        ^

[1] During: typing of assignment at lol.py (40)

File "lol.py", line 40:
def g(arg):  # fails
    <source elided>
    else:
        f = b
        ^

The text was updated successfully, but these errors were encountered:

stuartarchibald · 2018-10-15T10:13:10Z

Thanks for the report. I can reproduce. I think this is because the type information about the result of calling member function foo cannot be discerned in the case with the branch.

ogrisel · 2018-10-15T14:51:47Z

Note that this is not specific to jitclass: if you decorate the foo function with an @njit decorator you get a similar error message.

NicolasHug · 2018-10-15T15:18:24Z

You're right, thanks, I updated the example

stuartarchibald · 2018-10-15T16:15:07Z

Thanks for the update, I think I see what's going on now. I thought that this was something more complicated to do with the jitclass internals but I believe in actual fact the issue is more simple. It is that when doing type inference Numba sees the passed in functions as unique types, at present it doesn't know about what the function itself returns, there is no "function type" type. As a result the type inference mechanism sees something equivalent in objectionableness to:

@njit
def g(arg):  # fails
    if arg:
        f = np.int32(a)
    else:
        f = np.array(10)
    out = f()
    return out

Moving this to a feature request, functions as first-class types.

luk-f-a · 2018-12-30T16:35:48Z

hi, two questions:

does this feature cover use cases as in using a list/tuple of jitted functions #2542?
where would this fall in the roadmap (https://numba.pydata.org/numba-doc/dev/developer/roadmap.html)? will it make it before v1.0?

thanks!

seibert · 2019-01-02T14:30:25Z

It would enable #2542, I believe.

As for roadmap, the soonest would be in the Medium Term bucket, probably post 1.0.

luk-f-a · 2019-01-02T14:45:26Z

thanks @seibert
For now I'm using a workaround based on sklam's "chain" suggestion.

seibert · 2019-03-31T16:45:00Z

@sklam: What do you think is the level of effort to make a function type that matches on type signature? (rather than what we have now, where every function is its own type)

sklam · 2019-04-01T17:05:27Z

I don't have a good estimate yet. But, I have included some notes on possible solutions to it.

Let's use the following code as an example:

def ex(f, g, choose, args0, args1):
    if choose:
        h = f         # pos A
    else:
        h = g         # pos B
    
    x = h(*args0)     # pos C
    y = h(*args1)     # pos D
    return x, y

Option 1. Continuing to allow FunctionType to reference the actual function objects.

Thinking through type inference:

At A, h is typed as FunctionType(f).
At B, h is unified to MultiFunctionType([f, g])
At C, the h(*args0) call causes type-check on f(*args0) and g(*args0).
At D, the h(*args1) call causes type-check on f(*args1) and g(*args1).

Thinking through lowering:

The call site at C and D may invoke different versions of h (and different versions of f and g).
At C and D, the call can be lowered into a switch that contains different call sequence to invoke f and g respectively.

Notes:

This approach is not scalable. The caller function must know all possible called functions and a new version must be created per new function added.

Option 2. Make functions runtime objects

Thinking through type inference:

At A, h is typed as FunctionType(f).
At B, h is unified to MultiFunctionType() (no reference to actual function object)
At C, the h(*args0) must be able to produce the result type of the call. Without a reference to an actual function object, we can't proceed.

(backing up....)

At A, h is typed as FunctionType(f).
At B, h is unified to MultiFunctionType(f) (with reference to the first object only)
At C, the h(*args0) uses f (the first seen function object) as a typing-template for all possible function value here. It will provide a signature, which we will call sig0.
At D, the h(*args1) produces a signature, sig1.

Whenever ex is called, arguments binding to f and g must type-check to be compatible to sig0 and sig1. If valid, a call-table is emitted for the respective signatures and a dispatcher object with that call-table is used as the runtime value.

Notes:

This option is more scalable. No recompile of ex is needed when binding to f and g changes as long as they are compatible to the required call-signatures.
But, if they are multiple sets of call-signatures, determining which set to use will become expensive.

luk-f-a · 2019-04-01T21:14:08Z

I'll go out on a limb here, as this is so far from my area of expertise. what about a more limited version that requires the functions to be set in advance?

I am thinking about sklam's chain in #2542 or this example, which is a modified version of that chain.

@nb.njit
def ident(out, x):
    pass

def chain_assign(fs, inner=ident):
    head, tail = fs[-1], fs[:-1]
    item = len(fs)-1
    
    @nb.njit
    def assign(out, x):
        inner(out, x)
        out[item] = head(x)
    
    if tail:
        return chain_assign(tail, assign)
    else:
        return assign

In practice this defines an iterator over functions, but an immutable one. I guess this is what makes it work, avoiding the type-checking and lowering problems described above.
What if we could define in a more convenient way such an iterator or maybe a function-tuple?
Each instance of the iterator/function-tuple would be its own type (similar to how at the moment each function is its own type). Each instance must be created with a reference to actual functions, such that they can be signature-checked against each other.
The example above could be written as:

def ex(func_tuple, choice_index, args0, args1):
    x = func_tuple[choice_index](*args0)     # pos C
    y = func_tuple[choice_index](*args1)     # pos D
    return x, y

Here's where my lack of experience with the internals of Numba stops me. I cannot reproduce the step by step analysis sklam's did. I hope the idea is not too stupid, though.

ehsantn · 2019-04-02T03:07:54Z

I think function types need to carry the signature to have consistent typing and straight-forward implementation (similar to C). In this way, things like containers of functions as in #2542 will be easy as well. Following @sklam's example (assuming sig0 as typing template of f):

At A, h is typed as FunctionType(sig0).
At B, h is unified to FunctionType(sig0), assuming g has the same typing template (is template unification possible?).
At C, h(*args0) uses sig0 as a typing-template.
At D, h(*args1) uses sig0 as a typing-template.

At lowering, the signature is known from function variable type, and the function pointer is taken from function variable value and called. Also, supporting more advanced cases like inlining specific cases can be done using literals, value-based specialization, etc. Does this make sense?

sklam · 2019-04-02T15:16:20Z

@luk-f-a, there is still a difficulty in typing the code:

def ex(func_tuple, choice_index, args0, args1):
    x = func_tuple[choice_index](*args0)     # pos C
    y = func_tuple[choice_index](*args1)     # pos D
    return x, y

It is difficulty to determine the function type from func_tuple[choice_index]. Currently, Numba will only work if choice_index is a literal so that it can determine the type at compile time. If choice_index is a runtime variable, we still need to unify the type of func_tuple[0] and func_tuple[1]. Thus, it will be the equivalent problem as my example in #3405 (comment).

@ehsantn's suggestion will require explicit annotation from the user. For instance,

sig = Signature(...)
f_sig = f.compile(sig)
g_sig = g.compile(sig)
exc(f_sig, g_sig, ...)

In fact, we already have the .compile method. It just doesn't carry the call-signature in it. It sounds doable.

luk-f-a · 2019-04-04T07:10:18Z

@sklam, thanks, I guess I was confused by the following example:

@nb.njit
def f(a):
    return a

tup = (f, f)

@nb.njit
def ex(tup, ind,a , b):
    x = tup[ind](a)
    y = tup[ind](b)
    return x,y

ex(tup, 0, 1,2)

This works well, despite in theory not being able to determine the function type from func_tuple[choice_index]. I assumed that the only problem was whether incoming tuples are classified as UniTuples and therefore getting a getitem method. So I guess we need some guarantee that the signature is the same and that can only be done after compile?

Would compile be explicitly necessary? or would it be enough to do

@njit(sig_str)
def f

@njit(sig_str)
def g

exc(f, g,...)

luk-f-a · 2019-04-08T07:41:24Z

@sklam, going back to @seibert 's question, what do you think is the level of effort for someone familiar with Numba internals? thanks!

sklam · 2019-04-09T18:27:48Z

Explicit compile is probably not necessary and @njit(sig_str) should be the same.

As for effort, I do not expect the initial changes would be big but there may be a lot of subtle points to make it useful. It'd worth to do a quick experimental patch to see how involved it would be.

pearu · 2019-12-20T12:34:23Z

Heads up: with PR #4967, all the examples in this issue work provided the input functions are cfunc-decorated.

songololo · 2020-01-09T20:29:42Z

Looking forward to seeing this develop.

I am finding the new Dict and List types quite useful: Out of curiosity, once #4967 is implemented, would it be possible to pass a numba function type to a List or Dict?

pearu · 2020-01-09T20:45:51Z

I think having List[<FunctionType>], for instance, would be a natural use case of first-class functions. In fact, I don't see reasons atm why it would not work out of the box.

NicolasHug mentioned this issue Oct 12, 2018

[MRG] Histogram computation optimization ogrisel/pygbm#14

Merged

4 tasks

stuartarchibald added the bug label Oct 15, 2018

stuartarchibald added feature_request and removed bug labels Oct 15, 2018

stuartarchibald changed the title ~~Type unification error when using function pointer from inside a jitclass~~ Feature Request: Function as first-class type Oct 15, 2018

sklam self-assigned this Apr 9, 2019

luk-f-a mentioned this issue May 24, 2019

Exponentially increasing compilation time #4117

Closed

sklam removed their assignment Jul 23, 2019

sklam mentioned this issue Nov 18, 2019

Can not use vectorized, nopython functions (passed as arguments) inside njit'd functions #4869

Open

pearu mentioned this issue Dec 14, 2019

A prototype of first-class functions [DON'T MERGE] #4967

Closed

10 tasks

pearu mentioned this issue Feb 20, 2020

First-class function type #5287

Merged

2 tasks

sklam closed this as completed in #5287 Mar 27, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: Function as first-class type #3405

Feature Request: Function as first-class type #3405

NicolasHug commented Oct 12, 2018 •

edited

stuartarchibald commented Oct 15, 2018

ogrisel commented Oct 15, 2018

NicolasHug commented Oct 15, 2018

stuartarchibald commented Oct 15, 2018

luk-f-a commented Dec 30, 2018

seibert commented Jan 2, 2019

luk-f-a commented Jan 2, 2019

seibert commented Mar 31, 2019

sklam commented Apr 1, 2019

luk-f-a commented Apr 1, 2019

ehsantn commented Apr 2, 2019

sklam commented Apr 2, 2019

luk-f-a commented Apr 4, 2019

luk-f-a commented Apr 8, 2019

sklam commented Apr 9, 2019

pearu commented Dec 20, 2019

songololo commented Jan 9, 2020

pearu commented Jan 9, 2020

Feature Request: Function as first-class type #3405

Feature Request: Function as first-class type #3405

Comments

NicolasHug commented Oct 12, 2018 • edited

stuartarchibald commented Oct 15, 2018

ogrisel commented Oct 15, 2018

NicolasHug commented Oct 15, 2018

stuartarchibald commented Oct 15, 2018

luk-f-a commented Dec 30, 2018

seibert commented Jan 2, 2019

luk-f-a commented Jan 2, 2019

seibert commented Mar 31, 2019

sklam commented Apr 1, 2019

luk-f-a commented Apr 1, 2019

ehsantn commented Apr 2, 2019

sklam commented Apr 2, 2019

luk-f-a commented Apr 4, 2019

luk-f-a commented Apr 8, 2019

sklam commented Apr 9, 2019

pearu commented Dec 20, 2019

songololo commented Jan 9, 2020

pearu commented Jan 9, 2020

NicolasHug commented Oct 12, 2018 •

edited