Quantized _out functions don't follow same conventions as other out functions in the codebase #36508

ezyang · 2020-04-13T20:31:56Z

Currently they're defined like:

.op("quantized::add_out(Tensor qa, Tensor qb, Tensor(a!) out)"
     "-> Tensor(a!) out",
    c10::RegisterOperators::options()
      .aliasAnalysis(at::AliasAnalysisKind::FROM_SCHEMA)
      .kernel<QAddOut</*ReLUFused=*/false>>(DispatchKey::QuantizedCPU))

However, a standard out function looks like this:

- func: angle(Tensor self) -> Tensor
  use_c10_dispatcher: full
  variants: function, method
  supports_named_tensor: True

- func: angle.out(Tensor self, *, Tensor(a!) out) -> Tensor(a!)
  supports_named_tensor: True

cc @jerryzh168 @jianyuh @raghuramank100 @jamesr66a @vkuzo @jgong5 @Xia-Weiwen @leslie-fang-intel @dzhulgakov @kevinbchen (who added the alias analysis annotations to these functions) and @z-a-f (who appears to have originally added these out variants at #23971 )

The text was updated successfully, but these errors were encountered:

supriyar · 2020-04-15T00:13:54Z

cc @vkuzo

z-a-f · 2020-04-15T17:52:23Z

The quantized ops live under torch.quantized.ops.XXX and are not expected to be used by the end-user in Python. However the native_functions exposes the functions in python (mostly). Do we really need to have identical apis for the two? Also, I am not sure if dispatcher schema supports *?

dzhulgakov · 2020-04-17T06:35:03Z

Only kwarg arguments should work for custom op registration api too. Switching to it is probably not BC-compatible in practice but if jit serializes explicit kwargs properly it might be fine

ezyang · 2020-04-17T14:11:00Z

Do we really need to have identical apis for the two?

That is a great question! This isn't really related to this issue, but I don't know why quantization also registers their operators under the quantized namespace. Does anyone else know? (@jerryzh168 ?)

andrewor14 · 2023-11-17T22:38:08Z

@jerryzh168 Any context on this? Is it too late to change it now?

jerryzh168 · 2023-11-17T22:41:20Z

yeah this is no longer relevant I feel, since these ops are for server/edge CPU use case and used in original eager mode quantization. we have moved on in both backend (more focusing on server GPU + edge) and flow (more focusing on pt2e flow, and the current GPU + triton flow)

mrshenli added better-engineering Relatively self-contained tasks for better engineering contributors oncall: quantization Quantization support in PyTorch triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module labels Apr 14, 2020

supriyar assigned z-a-f Apr 15, 2020

andrewor14 unassigned z-a-f Nov 17, 2023

andrewor14 added the low priority We're unlikely to get around to doing this in the near future label Nov 17, 2023

andrewor14 assigned jerryzh168 Nov 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quantized _out functions don't follow same conventions as other out functions in the codebase #36508

Quantized _out functions don't follow same conventions as other out functions in the codebase #36508

ezyang commented Apr 13, 2020 •

edited by pytorch-bot bot

supriyar commented Apr 15, 2020

z-a-f commented Apr 15, 2020

dzhulgakov commented Apr 17, 2020

ezyang commented Apr 17, 2020

andrewor14 commented Nov 17, 2023

jerryzh168 commented Nov 17, 2023

Quantized _out functions don't follow same conventions as other out functions in the codebase #36508

Quantized _out functions don't follow same conventions as other out functions in the codebase #36508

Comments

ezyang commented Apr 13, 2020 • edited by pytorch-bot bot

supriyar commented Apr 15, 2020

z-a-f commented Apr 15, 2020

dzhulgakov commented Apr 17, 2020

ezyang commented Apr 17, 2020

andrewor14 commented Nov 17, 2023

jerryzh168 commented Nov 17, 2023

ezyang commented Apr 13, 2020 •

edited by pytorch-bot bot