RFC-0010: TensorRef #16

ezyang · 2021-03-03T20:18:48Z

This proposal introduces a new class TensorRef, to replace all
places in our codebase where we currently use const Tensor&. The
distinguishing characteristics of this class are:

It is non-owning
It is as safe as other by-value reference types (like c10::ArrayRef or
std::string_view)
It is implicitly convertible (with some exceptions) to const Tensor&
(i.e., it can be introduced incrementally)

Rendered

Signed-off-by: Edward Z. Yang ezyang@fb.com

Signed-off-by: Edward Z. Yang <ezyang@fb.com>

smessmer · 2021-03-03T21:12:10Z

RFC-0010-tensor-ref.md

+  we unsafely use reclaim/release to ensure that no refcount bump occurs
+  on construction/destruction of the object.  We verified with Godbolt
+  that the compiler is able to eliminate the `base_` destructor.  This
+  makes it possible to take out a `const Tensor&`, which makes it easier


as long as the Tensor is purely internal to TensorRef, this could be safe since we have control over the invariants and where refcounting occurs. But allowing users to get a const Tensor& out of it means that the "internal" Tensor object could be copied outside of your control and you have to be much more careful about correctness, especially since Tensor doesn't seem to know if it's borrowed or not. I saw one of your earlier proposals had Tensor know about this, maybe we should do that?

Or, if the const Tensor& conversion is only there to avoid having to rewrite all ops at once, maybe there's a regex/codemod that would allow us to rewrite them more easily and then not allow TensorRef -> const Tensor& conversions? Or, if we manage to eliminate most but not all call sites, we might be able to make it a TensorRef -> Tensor (by value) conversion and be ok with the refcount if somebody does that. Seems like a less dangerous design.

But allowing users to get a const Tensor& out of it means that the "internal" Tensor object could be copied outside of your control and you have to be much more careful about correctness

A copy of const Tensor& to Tensor is OK, because this always induces a refcount bump! Most of the bad situations @swolchok was able to come with required you to have a mutable reference/pointer to the internal Tensor, which we just forbid here.

Or, if the const Tensor& conversion is only there to avoid having to rewrite all ops at once, maybe there's a regex/codemod that would allow us to rewrite them more easily and then not allow TensorRef -> const Tensor& conversions?

Yeah. I allude to this in the second alternative proposal, where for Intel ABI reasons, it is much better if the long term API doesn't permit conversion to const Tensor& (so we can make the class trivial). So ideally we'd get rid of this BC crutch eventually.

smessmer · 2021-03-03T21:20:56Z

RFC-0010-tensor-ref.md

+* Instead of trying to force the above implementation of `TensorRef` to
+  be used everywhere, it could instead be a utility class used in
+  limited situations to improve interoperability with code that expects
+  a `const Tensor&` when you don't have a `Tensor` available.  The true,


I don't follow why there are two objects - a trivial and a nontrivial one. Can you give some more details?

The nontrivial TensorRef class is given in this proposal. The trivial class is:

class TrivialTensorRef { TensorImpl* impl_; };

oh I see. I like the trivial one much better from a design point of view ;) But I guess you agree on that point and the non-trivial one only exists for the bc reasons discussed above.

Signed-off-by: Edward Z. Yang <ezyang@fb.com>

ezyang · 2021-04-10T02:35:14Z

A variation of this has been implemented in pytorch/pytorch#55685

swolchok · 2021-05-26T22:34:41Z

A variation of this has been implemented in pytorch/pytorch#55685

As with the TensorRef in the proposal, MaybeOwned is hobbled by the Itanium ABI requirement to pass it by reference always, so it doesn't solve the argument passing problem either.

TensorRef rfc

6c6cc40

Signed-off-by: Edward Z. Yang <ezyang@fb.com>

facebook-github-bot added the cla signed label Mar 3, 2021

Add another alternative

dad05fc

Signed-off-by: Edward Z. Yang <ezyang@fb.com>

smessmer reviewed Mar 3, 2021

View reviewed changes

public version fo doc

fe70962

Signed-off-by: Edward Z. Yang <ezyang@fb.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC-0010: TensorRef #16

RFC-0010: TensorRef #16

ezyang commented Mar 3, 2021

smessmer Mar 3, 2021

ezyang Mar 3, 2021

smessmer Mar 3, 2021

ezyang Mar 3, 2021

smessmer Mar 3, 2021

ezyang commented Apr 10, 2021

swolchok commented May 26, 2021

RFC-0010: TensorRef #16

Are you sure you want to change the base?

RFC-0010: TensorRef #16

Conversation

ezyang commented Mar 3, 2021

smessmer Mar 3, 2021

Choose a reason for hiding this comment

ezyang Mar 3, 2021

Choose a reason for hiding this comment

smessmer Mar 3, 2021

Choose a reason for hiding this comment

ezyang Mar 3, 2021

Choose a reason for hiding this comment

smessmer Mar 3, 2021

Choose a reason for hiding this comment

ezyang commented Apr 10, 2021

swolchok commented May 26, 2021