[Unity][Op] Gradient functions for high-level Relax operators by SiriusNEO · Pull Request #14527 · apache/tvm

SiriusNEO · 2023-04-07T12:29:19Z

Intro

This PR registers gradient functions for many high-level Relax operators. Similar with Relay, the gradient function is registered as an attribute FPrimalGradient (OpAttr) of corresponding Relax operators. But the function signature is different from Relay:

using FPrimalGradient = runtime::TypedPackedFunc<tvm::Array<Expr>(
    const Var& orig_var, const Call& orig_call, const Var& output_grad, const BlockBuilder& ctx)>;

orig_call is the orginal call expr which we want to differentiate.
output_grad is the gradient of RHS.
orig_var is y. It is passed to saving some calculations.
ctx is the context which is not used right now. But we believe it is useful when it comes to dynamic shape cases and when we need to emit some bindings or do some normalizations.

For some complicate gradient functions, we introduce some high-level backward operators and put them under the namespace op.grad.xxx. All gradient functions are well tested (numerically). For more details please check Part 2 of this document.

Others

Also this PR fixes two small problems about op:

CumsumAttrs isn't declared in the Python side.
A small problem in the implementation about legalizing op variance.

Co-authored-by: Yixin Dong ubospica@gmail.com

tvm-bot · 2023-04-07T12:29:22Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

cc @quic-sanirudh _{See #10317 for details}

_{Generated by tvm-bot}

tqchen · 2023-04-07T13:50:58Z

@SiriusNEO one minor note, make sure you append the co-author commit message to the end

python/tvm/relax/op/_op_gradient.py

Co-authored-by: Yixin Dong <ubospica@gmail.com>

yongwww reviewed Apr 7, 2023

View reviewed changes

python/tvm/relax/op/_op_gradient.py Outdated Show resolved Hide resolved

SiriusNEO and others added 3 commits April 8, 2023 07:34

initial commit

f36d4ea

Co-authored-by: Yixin Dong <ubospica@gmail.com>

add typing

378947d

fix lint

7ecef07

SiriusNEO force-pushed the unity-dev/2023-04-07-gradient-functions branch from 2bfe776 to 7ecef07 Compare April 7, 2023 23:36

SiriusNEO added 2 commits April 8, 2023 07:39

fix comments

b3ae8a1

fix tests

f8c8160

tqchen approved these changes Apr 8, 2023

View reviewed changes

tqchen merged commit d518238 into apache:unity Apr 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Unity][Op] Gradient functions for high-level Relax operators#14527

[Unity][Op] Gradient functions for high-level Relax operators#14527
tqchen merged 5 commits intoapache:unityfrom
SiriusNEO:unity-dev/2023-04-07-gradient-functions

SiriusNEO commented Apr 7, 2023

Uh oh!

tvm-bot commented Apr 7, 2023

Uh oh!

tqchen commented Apr 7, 2023

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

SiriusNEO commented Apr 7, 2023

Intro

Others

Uh oh!

tvm-bot commented Apr 7, 2023

Uh oh!

tqchen commented Apr 7, 2023

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants