[Relax][Training] Add automatic differentiation pass #103

Ubospica · 2023-01-18T12:37:06Z

This is the PR following #55 after source branch moved to personal repo.

This PR is based on #98.

This PR adds the new automatic differentiation API:

Gradient(func: GlobalVar, require_grads: Optional[Union[Var, List[Var]]] = None) -> tvm.ir.transform.Pass
- transforms the given funcion in the IRModule, and adds a new function that calculates the gradient with regard to the function's output

Now Gradient only supports differentiating a function in the IRModule with one dataflow block with respect to the only return value of the function, which needs to be scalar.

This PR writes two files for unit test:

tests/python/relax/test_transform_gradient.py only contains assert_structural_equal assertions.
tests/python/relax/test_transform_gradient_numeric.py contains numeric checks, including manually derived gradients and the numerical differentiation method check_numerical_grads.

Checkpoints:

Refactor to use CopyWithNewParams and ExprFunctor
Check int64/int32 tensors should not be differentiated (now only check in params)
Rebase & migrate to StructInfo
Refactor about Tuple
Refactor about NestedMsg
Support ops taking in tuple or returning tuple
Eliminating collapse_sum_to (done in [Op] Migration: Gradients for some operators #98)

Future:

(Not in this PR) Handle undefined gradient in add and return value
- Now we handle them as zeros

debug finished Change details on and modify document polish test unit test finished reformat changed files Fix problems in pr comments move op and change them static in AD pass fix some problems fix for comments so far update documents formatted update document1 draft 1 revise version 1 version 1 completed and formatted revise on doc and detail polish after refactor remove zeros_tracker_ update doc restructure and add float check for input format refactor done refactor again add test_tuple_ops update comments rebase onto struct info fix log_softmax case fix test eliminator draft fix after rebase refactor draft Revise doc & Epilogue & details remove some irrelevant codes remove collapse_sum_eliminator

normalize has problems to fix

include/tvm/relax/nested_msg.h

src/relax/transform/gradient.cc

python/tvm/relax/transform/transform.py

modify nested_msg.h, util.h add testcases

src/relax/transform/gradient.cc

src/relax/transform/utils.cc

include/tvm/relax/nested_msg.h

SiriusNEO

Looks good to me now

MasterJH5574 · 2023-01-23T14:20:23Z

Thanks for the great efforts in pushing this PR! Would you folks mind sending the changes of nested msg together with the test in this PR to tlc-pack when getting time? That would ease our next sync. But no hurry at all - enjoy your holidays :-)

This PR is a small fix patch for #103, containing two small modifications: - In AD PR we introduce `NestedMsgToExpr`, where the signature of `fmapleaf` is `Expr fmapleaf(T)`. And for null nested msg it will directly throws error. Now we change it to `Expr fmapleaf(Optional<T>)` and pass `NullOpt` to `fmapleaf`, which enables user to decide whether to throw an error or return some default value. - In the test of nested msg we forget to change the signature of `fmapleaf` (originally it is `Expr fmapleaf(NestedMsg<T>)`). It still passed due to the type casting between `NestedMsg<T>` and `T` but it needs to be fixed.

This is the PR following #55 after source branch moved to personal repo. This PR is based on #98. This PR adds the new automatic differentiation API: - `Gradient(func: GlobalVar, require_grads: Optional[Union[Var, List[Var]]] = None) -> tvm.ir.transform.Pass` - transforms the given funcion in the IRModule, and adds a new function that calculates the gradient with regard to the function's output Now Gradient only supports differentiating a function in the IRModule with one dataflow block with respect to the only return value of the function, which needs to be scalar. This PR writes two files for unit test: - `tests/python/relax/test_transform_gradient.py` only contains `assert_structural_equal` assertions. - `tests/python/relax/test_transform_gradient_numeric.py` contains numeric checks, including manually derived gradients and the numerical differentiation method `check_numerical_grads`. Checkpoints: - [x] Refactor to use CopyWithNewParams and ExprFunctor - [x] Check int64/int32 tensors should not be differentiated (now only check in params) - [x] Rebase & migrate to StructInfo - [x] Refactor about Tuple - [x] Refactor about NestedMsg - [x] Support ops taking in tuple or returning tuple - [x] Eliminating collapse_sum_to (done in #98) Future: - (Not in this PR) Handle undefined gradient in add and return value - Now we handle them as zeros Co-authored-by: SiriusNEO <1713833595@qq.com>

This PR is a small fix patch for #103, containing two small modifications: - In AD PR we introduce `NestedMsgToExpr`, where the signature of `fmapleaf` is `Expr fmapleaf(T)`. And for null nested msg it will directly throws error. Now we change it to `Expr fmapleaf(Optional<T>)` and pass `NullOpt` to `fmapleaf`, which enables user to decide whether to throw an error or return some default value. - In the test of nested msg we forget to change the signature of `fmapleaf` (originally it is `Expr fmapleaf(NestedMsg<T>)`). It still passed due to the type casting between `NestedMsg<T>` and `T` but it needs to be fixed.

This is the PR following #55 after source branch moved to personal repo. This PR is based on #98. This PR adds the new automatic differentiation API: - `Gradient(func: GlobalVar, require_grads: Optional[Union[Var, List[Var]]] = None) -> tvm.ir.transform.Pass` - transforms the given funcion in the IRModule, and adds a new function that calculates the gradient with regard to the function's output Now Gradient only supports differentiating a function in the IRModule with one dataflow block with respect to the only return value of the function, which needs to be scalar. This PR writes two files for unit test: - `tests/python/relax/test_transform_gradient.py` only contains `assert_structural_equal` assertions. - `tests/python/relax/test_transform_gradient_numeric.py` contains numeric checks, including manually derived gradients and the numerical differentiation method `check_numerical_grads`. Checkpoints: - [x] Refactor to use CopyWithNewParams and ExprFunctor - [x] Check int64/int32 tensors should not be differentiated (now only check in params) - [x] Rebase & migrate to StructInfo - [x] Refactor about Tuple - [x] Refactor about NestedMsg - [x] Support ops taking in tuple or returning tuple - [x] Eliminating collapse_sum_to (done in #98) Future: - (Not in this PR) Handle undefined gradient in add and return value - Now we handle them as zeros Co-authored-by: SiriusNEO <1713833595@qq.com>

Ubospica mentioned this pull request Jan 18, 2023

[Relax][AD] Add automatic differentiation pass #55

Closed

6 tasks

SiriusNEO mentioned this pull request Jan 19, 2023

[Tracking Issue] Relax training M0 migration and polishment #97

Closed

20 tasks

Ubospica added 3 commits January 19, 2023 17:22

fix after rebase

721e971

fix first draft

da9f2d7

normalize has problems to fix

SiriusNEO force-pushed the mlc-dev/2023-01-18-ad_after_tuple_refactor branch from f2364e4 to da9f2d7 Compare January 19, 2023 09:25

nested msg in adjoint

ac98e81

SiriusNEO force-pushed the mlc-dev/2023-01-18-ad_after_tuple_refactor branch from 2eafa15 to ac98e81 Compare January 19, 2023 14:00

SiriusNEO added 2 commits January 19, 2023 23:30

add complex test & lint

892eb97

lint

cd34f4d

SiriusNEO force-pushed the mlc-dev/2023-01-18-ad_after_tuple_refactor branch from bb035e4 to cd34f4d Compare January 19, 2023 15:48

refactor

5073157

Ubospica force-pushed the mlc-dev/2023-01-18-ad_after_tuple_refactor branch from f6d5622 to 79a9e94 Compare January 19, 2023 19:12

Ubospica changed the title ~~[WIP][Relax][AD] Add automatic differentiation pass~~ [Relax][AD] Add automatic differentiation pass Jan 19, 2023

Ubospica force-pushed the mlc-dev/2023-01-18-ad_after_tuple_refactor branch from 79a9e94 to ad2495c Compare January 19, 2023 19:18

add MapToNestedMsgNew; modify gradient

7eb39f8

Ubospica force-pushed the mlc-dev/2023-01-18-ad_after_tuple_refactor branch from ad2495c to 7eb39f8 Compare January 19, 2023 19:29

spectrometerHBH reviewed Jan 19, 2023

View reviewed changes

include/tvm/relax/nested_msg.h Outdated Show resolved Hide resolved

include/tvm/relax/nested_msg.h Outdated Show resolved Hide resolved

spectrometerHBH reviewed Jan 19, 2023

View reviewed changes

src/relax/transform/gradient.cc Outdated Show resolved Hide resolved

src/relax/transform/gradient.cc Outdated Show resolved Hide resolved

src/relax/transform/gradient.cc Outdated Show resolved Hide resolved

spectrometerHBH reviewed Jan 19, 2023

View reviewed changes

SiriusNEO reviewed Jan 20, 2023

View reviewed changes

python/tvm/relax/transform/transform.py Outdated Show resolved Hide resolved

update comments

1f747ce

modify nested_msg.h, util.h add testcases

Ubospica force-pushed the mlc-dev/2023-01-18-ad_after_tuple_refactor branch from 9421fff to 9011549 Compare January 20, 2023 09:09

formatted

d2854c6

Ubospica force-pushed the mlc-dev/2023-01-18-ad_after_tuple_refactor branch from 9011549 to d2854c6 Compare January 20, 2023 09:11

update doc

1570242

Ubospica requested review from tqchen and MasterJH5574 January 21, 2023 16:26

MasterJH5574 reviewed Jan 21, 2023

View reviewed changes

src/relax/transform/gradient.cc Outdated Show resolved Hide resolved

src/relax/transform/gradient.cc Outdated Show resolved Hide resolved

src/relax/transform/gradient.cc Outdated Show resolved Hide resolved

src/relax/transform/utils.cc Outdated Show resolved Hide resolved

Ubospica force-pushed the mlc-dev/2023-01-18-ad_after_tuple_refactor branch from 3b56962 to 0cdd5dc Compare January 22, 2023 09:25

address comments

e6362d6

SiriusNEO requested review from spectrometerHBH and MasterJH5574 January 23, 2023 01:57

MasterJH5574 reviewed Jan 23, 2023

View reviewed changes

include/tvm/relax/nested_msg.h Outdated Show resolved Hide resolved

Ubospica requested review from MasterJH5574 and SiriusNEO January 23, 2023 07:31

Ubospica force-pushed the mlc-dev/2023-01-18-ad_after_tuple_refactor branch 2 times, most recently from c6c10dc to e5f4504 Compare January 23, 2023 07:37

handle comments

0389aaf

Ubospica force-pushed the mlc-dev/2023-01-18-ad_after_tuple_refactor branch from e5f4504 to 0389aaf Compare January 23, 2023 07:45

some modifications

3c0d8fc

SiriusNEO force-pushed the mlc-dev/2023-01-18-ad_after_tuple_refactor branch from 04572e8 to 3c0d8fc Compare January 23, 2023 09:58

SiriusNEO approved these changes Jan 23, 2023

View reviewed changes

MasterJH5574 approved these changes Jan 23, 2023

View reviewed changes

spectrometerHBH approved these changes Jan 23, 2023

View reviewed changes

MasterJH5574 merged commit 5f31e8e into mlc-ai:relax Jan 23, 2023

SiriusNEO mentioned this pull request Jan 24, 2023

[FIX][AD] Fix nested msg utils introduced in AD Pass #109

Merged

SiriusNEO mentioned this pull request Jan 29, 2023

[ARCH] NestedMsg util functions update tlc-pack/relax#390

Merged

SiriusNEO mentioned this pull request Apr 9, 2023

[Unity][Transform] High-level reverse-mode automatic differentiation pass apache/tvm#14542

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Relax][Training] Add automatic differentiation pass #103

[Relax][Training] Add automatic differentiation pass #103

Ubospica commented Jan 18, 2023 •

edited

Loading

SiriusNEO left a comment

MasterJH5574 commented Jan 23, 2023

[Relax][Training] Add automatic differentiation pass #103

[Relax][Training] Add automatic differentiation pass #103

Conversation

Ubospica commented Jan 18, 2023 • edited Loading

SiriusNEO left a comment

Choose a reason for hiding this comment

MasterJH5574 commented Jan 23, 2023

Ubospica commented Jan 18, 2023 •

edited

Loading