Bad overload order for zeros_like #19685

suo · 2019-04-24T18:33:22Z

In the pytorch_torch_functions.cpp generated code, the zeros_like binding looks like

static PyObject * THPVariable_zeros_like(PyObject* self_, PyObject* args, PyObject* kwargs)
{
  HANDLE_TH_ERRORS
  static PythonArgParser parser({
    "zeros_like(Tensor input, *, ScalarType dtype=None, Layout layout=torch.strided, Device device=None, bool pin_memory=False, bool requires_grad=False)",
    "zeros_like(Tensor input, *, bool requires_grad=False)",
  }, /*traceable=*/true);
...

In this case, the second overload will never get hit as all the arguments in the first overload have defaults. This is causing a bug where tracing will "constant-ify" device instead of correctly deriving the device at runtime from the input tensor (see #19637).

I tracked the overload sorting code to here but I don't understand it. My desired behavior is to swap the order of the overloads in cases like the above.

Can anyone help me out? cc. @gchanan, @cpuhrsch, @ezyang

The text was updated successfully, but these errors were encountered:

colesbury · 2019-04-24T20:44:10Z

Why are there two overloads in the first place? We should strive for only one overload as this makes error messages much more consistent with other Python functions.

suo · 2019-04-24T20:52:18Z

That's true. In reality, all the TensorOptions arguments need to be optionals, and we should merge the TensorOptions passed with the input tensor's option as the fallback value.

Right now the logic to do that is in the binding code (pytorch_torch_functions), which means that it is not reusable from the JIT, so we would need to move that to the actual operator implementation.

Who is the right person to work on this @gchanan? I can help but I don't know how it fits into the ongoing schema unification thing

gchanan · 2019-04-30T16:07:25Z

removed high priority because I think the JIT issue is real, but the specific overload issue doesn't seem serious, as pointed out by @colesbury.

Who is the right person to work on this @gchanan? I can help but I don't know how it fits into the ongoing schema unification thing

The "frontend" schema unification part is complete in that all function signatures in native match their corresponding signature in JIT. The backends are still super complicated, because they do things like splatting / reverse splatting TensorOptions for no good reason.

I believe @VitalyFedyunin has been looking a bit into TensorOptions sanity, but I don't know exactly what he has planned in the short term.

VitalyFedyunin · 2019-04-30T16:36:28Z

I'm on hold with the TensorOptions while @smessmer working on the codegen unification, As it might introduce more complexity into the code base.

bhosmer · 2019-12-12T01:41:30Z

@izdeby let's check back on this after #30983 lands. cc @gchanan

fmassa added high priority module: pybind Related to our Python bindings / interactions with other Python libraries topic: operator triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module labels Apr 26, 2019

gchanan removed the high priority label Apr 30, 2019

izdeby mentioned this issue Dec 12, 2019

Codegen refactoring master task #30405

Open

15 tasks

mruberry added module: tensor creation and removed topic: operator labels Oct 10, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bad overload order for zeros_like #19685

Bad overload order for zeros_like #19685

suo commented Apr 24, 2019 •

edited

colesbury commented Apr 24, 2019

suo commented Apr 24, 2019

gchanan commented Apr 30, 2019

VitalyFedyunin commented Apr 30, 2019

bhosmer commented Dec 12, 2019

Bad overload order for zeros_like #19685

Bad overload order for zeros_like #19685

Comments

suo commented Apr 24, 2019 • edited

colesbury commented Apr 24, 2019

suo commented Apr 24, 2019

gchanan commented Apr 30, 2019

VitalyFedyunin commented Apr 30, 2019

bhosmer commented Dec 12, 2019

suo commented Apr 24, 2019 •

edited