add to ExternalBackendFunction data model #56834

bdhirsh · 2021-04-23T22:52:29Z

A few tweaks to the codegen that make the next PR (generating out/inplace wrappers) easier.

[main change] When generating ExternalBackendFunction objects, I added the backend's kernel to the dispatch entry of NativeFunction. The invariant now is that if ExternalBackendFunction implements the XLA backend, then it should expect its corresponding NativeFunction to have an XLA entry in the dispatch table. I do that with a new with_dispatch_key() method on NativeFunction. This makes it easier to re-use the code in register_dispatch_key.py, since the external logic now is mostly just "another dispatch key" (with some exceptions; see the next PR). I'm open to other opinions though.
Doing that requires parsing the backend into a valid dispatch key, so I added more validations and tests. We also technically need to parse the backend's autograd key if they've provided any autograd kernels (if xla provides "backend: XLA" and they have an autograd section, I try to parse the "AutogradXLA" key), so I added tests for that too.
I ended up removing ExternalBackendFunctionsGroup.from_function_group and putting the logic directly in gen_backend_stubs.py. Kind of annoying, but I mostly did it because storing the kernel name directly in ExternalBackendFunction (in the dispatch table) requires calling the dispatcher API, and we can't add a dependency on dispatcher.py directly in model.py. We don't have that problem with NativeFunction objects because the kernel name is provided directly in the yaml, whereas for external ops, we force them to follow the dispatcher convention for naming.

Stack from ghstack:

[do not merge] temporary fix #56962 [do not merge] temporary fix
generate in-place/out wrappers for external kernels #56835 [WIP] generate in-place/out wrappers for external kernels
add to ExternalBackendFunction data model #56834 add to ExternalBackendFunction data model
remove bridge API from codegen #55796 remove bridge API from codegen
[external codegen] better yaml error messaging, added explicit error message tests #56597 [external codegen] better yaml error messaging, added explicit error message tests
add _to_cpu() operator #55795 add _to_cpu() operator

[ghstack-poisoned]

facebook-github-bot · 2021-04-23T22:52:35Z

💊 CI failures summary and remediations

As of commit 245034d (more details on the Dr. CI page):

1/1 failures introduced in this PR

1 failure not recognized by patterns:

Job	Step	Action
^quick-checks	^{Ensure no unqualified type ignore}	🔁 rerun

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

[ghstack-poisoned]

A few tweaks to the codegen that make the next PR (generating out/inplace wrappers) easier. 1. [**main change**] When generating `ExternalBackendFunction` objects, I added the backend's kernel to the `dispatch` entry of `NativeFunction`. The invariant now is that if `ExternalBackendFunction` implements the `XLA` backend, then it should expect its corresponding `NativeFunction` to have an `XLA` entry in the dispatch table. I do that with a new `with_dispatch_key()` method on `NativeFunction`. This makes it easier to re-use the code in `register_dispatch_key.py`, since the external logic now is mostly just "another dispatch key" (with some exceptions; see the next PR). I'm open to other opinions though. 2. Doing that requires parsing the backend into a valid dispatch key, so I added more validations and tests. We also technically need to parse the backend's autograd key if they've provided any autograd kernels (if xla provides "backend: XLA" and they have an autograd section, I try to parse the "AutogradXLA" key), so I added tests for that too. 3. I ended up removing `ExternalBackendFunctionsGroup.from_function_group` and putting the logic directly in `gen_backend_stubs.py`. Kind of annoying, but I mostly did it because storing the kernel name directly in `ExternalBackendFunction` (in the dispatch table) requires calling the dispatcher API, and we can't add a dependency on `dispatcher.py` directly in `model.py`. We don't have that problem with NativeFunction objects because the kernel name is provided directly in the yaml, whereas for external ops, we force them to follow the dispatcher convention for naming. [ghstack-poisoned]

ezyang · 2021-04-27T01:34:11Z

tools/codegen/model.py

        return str(self).lower()

+    def is_autograd_key(self) -> bool:
+        return 'Autograd' in str(self)


perhaps, if str(self).startswith('Autograd') as a more restrictive option

ezyang · 2021-04-27T01:38:13Z

tools/codegen/model.py

+        try:
+            return DispatchKey.parse(value)
+        except AssertionError:
+            return None


Will be wondering why we need this. If we actually need to detect parse errors we'll need to roll out a separate exception hierarchy for them; catching assert errors is too dangerous

Fair; I can rewrite this to avoid the try/catch. Alternatively, if we're fine with the original parsing error message then we don't need this

ezyang · 2021-04-27T01:40:27Z

tools/codegen/model.py

        )

+    @staticmethod
+    def with_dispatch_entry(f: 'NativeFunction', dispatch_key: DispatchKey, kernel: str) -> 'NativeFunction':


It's a little odd this isn't a method, given that it takes a NativeFunction as argument

ezyang · 2021-04-27T01:42:42Z

tools/codegen/gen_backend_stubs.py


    backend = yaml_values.pop('backend', None)
    assert backend is not None, 'You must provide a value for "backend"'
+    backend_key = DispatchKey.try_parse(backend)


Can't you just rely on the parser failure propagating here?

I totally can- I added this to make the error message clearer. That way if e.g., a backend without a corresponding autograd key tried to add an autograd entry, they'd get a more actionable error message than a parsing error.

Do you think the parsing error is good enough? We probably want good documentation on the new codegen anyway, that makes it clear how to opt into autograd kernels.

Another way to make the error better is to "push an error context" before doing the parse. The preexisting example:

with context(f'in native_functions.yaml line {f.loc}:\n {f.func}'):

ezyang · 2021-04-27T01:52:02Z

tools/codegen/gen_backend_stubs.py

+            dispatch_key = DispatchKey.parse(f'Autograd{backend}') \
+                if m is not None and m.is_autograd else DispatchKey.parse(backend)
+            kernel = kernel_name(f.func)
+            return ExternalBackendFunction(NativeFunction.with_dispatch_entry(f, dispatch_key, kernel), dispatch_key, m)


I'm thinking back to https://github.com/pytorch/pytorch/pull/55050/files#r616215579

The stated motivation for patching in the dispatch information is to make it easier to reuse existing code in the codegen. This is very reasonable. But to me this more and more bends the orientation of the system towards trying to reuse the existing data structures or directly adding in the information as you need, and then augmenting it with side data (so the initial model doesn't have to changed).

After talking to Ed, I'm working on a new patch that will change most of the contents of this PR:

Rather than augmenting NativeFunction directly to be external-backend-aware, I'm going to move data on NativeFunction that is backend-dependent into a different struct. Important stuff it'll contain includes (a) the kernel name (different per backend), and (b) whether the kernel is structured (technically the same for all in-tree backends, but can be different per external backend).

ezyang · 2021-04-27T02:29:14Z

tools/codegen/gen_backend_stubs.py

+            kernel = kernel_name(g.out.func)
+            dispatch_key = DispatchKey.parse(f'Autograd{backend}') \
+                if out_meta is not None and out_meta.is_autograd else DispatchKey.parse(backend)
+            out = ExternalBackendFunction(NativeFunction.with_dispatch_entry(g.out, dispatch_key, kernel), dispatch_key, out_meta)


The three blocks here feel juuuust long enough to want some deduping

ezyang

okey dokey

github-actions · 2022-04-13T04:42:18Z

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

add to ExternalBackendFunction data model

9076062

[ghstack-poisoned]

facebook-github-bot added the cla signed label Apr 23, 2021

This was referenced Apr 23, 2021

add _to_cpu() operator #55795

Closed

[external codegen] better yaml error messaging, added explicit error message tests #56597

Closed

remove bridge API from codegen #55796

Closed

generate in-place/out wrappers for external kernels #56835

Closed

Update on "add to ExternalBackendFunction data model"

ffc1fe9

[ghstack-poisoned]

bdhirsh requested review from bhosmer and ezyang April 26, 2021 17:41

bdhirsh mentioned this pull request Apr 26, 2021

[do not merge] temporary fix #56962

Closed

ezyang reviewed Apr 27, 2021

View reviewed changes

ezyang approved these changes Apr 27, 2021

View reviewed changes

bdhirsh mentioned this pull request May 4, 2021

[codegen] split out backend-specific information from NativeFunction in the model #57361

Closed

github-actions bot added the Stale label Apr 13, 2022

github-actions bot closed this May 13, 2022

facebook-github-bot deleted the gh/bdhirsh/108/head branch June 12, 2022 14:18

add to ExternalBackendFunction data model #56834

add to ExternalBackendFunction data model #56834

Uh oh!

Conversation

bdhirsh commented Apr 23, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented Apr 23, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

1 failure not recognized by patterns:

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ezyang left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Apr 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

bdhirsh commented Apr 23, 2021 •

edited

Loading

facebook-github-bot commented Apr 23, 2021 •

edited

Loading