[Unity] Implement FNormalize attribute for operators #16067

Lunderberg · 2023-11-03T18:10:54Z

Some Relax operators have requirements regarding their AST that are stronger than are checked by the C++ types being used. These are similar to checks that are present in the tvm::relax::WellFormed utility, such as checks forbidding the use of undefined variables, which are also stronger than required by the underlying C++ types. However, because every operator may have unique requirements, it would be unreasonable to expect a writer of a relax::ExprMutator to be aware of and to maintain all such requirements.

This PR introduces an operation operator attribute FNormalize. If defined, this function is used to apply an operator-specific normalization. The implementation of FNormalize has the following design decisions.

If no change is required, FNormalize should return the input argument unmodified.
FNormalize is only responsible for normalization of the operator itself. The expression it returns may be unnormalized (e.g. contain nested expressions).
FNormalize receives the BlockBuilder as an argument, to allow context-dependent normalization.

For example, an operator whose normalization requires in-line expressions may use BlockBuilder::LookupBinding to perform variable replacement.
FNormalize is applied after FInferStructInfo. FNormalize may assume that the relax::Call passed to FNormalize has well-defined struct info.
- Corollary: FInferStructInfo may not assume that its relax::Call argument has been passed through FNormalize.
  
  This is a reasonable requirement, because (1) shape inference should depend only on the struct info of arguments and not the values themselves, and (2) this only impacts operators that use FNormalize.
FNormalize should not be used to apply simplifications, and should be limited to cases where the same computation may be expressed in multiple manners.

For example, replacing a by-variable tuple with an in-line tuple in R.call_tir is a form of normalization, but replacing R.add(arg, R.const(0)) with arg is a form of simplification.

This separation is to ensure that FNormalize has minimal overhead, as some simplifications may have large computational costs, and FNormalize is applied as part of all ExprMutator usage. A later PR will introduce an attribute FSimplify, along with a dedicated pass to apply simplifications.
Use of FNormalize is suppressed while parsing TVMScript. TVMScript must be able to generate test cases that trigger specific failure modes, and that may include producing un-normalized relax IR. In addition, TVMScript must be stable when passed through a round-trip from IR to text to IR.
If an IRModule contains any non-normalized operators, the IRModule is ill-formed. That is, all FNormalize operations on a well-formed module are no-ops.

Some Relax operators have requirements regarding their AST that are stronger than are checked by the C++ types being used. These are similar to checks that are present in the `tvm::relax::WellFormed` utility, such as checks forbidding the use of undefined variables, which are also stronger than required by the underlying C++ types. However, because every operator may have unique requirements, it would be unreasonable to expect a writer of a `relax::ExprMutator` to be aware of and to maintain all such requirements. This PR introduces an operation operator attribute `FNormalize`. If defined, this function is used to apply an operator-specific normalization. * If no change is required, `FNormalize` should return the input argument unmodified. * `FNormalize` is only responsible for normalization of the operator itself. The expression it returns may be unnormalized (e.g. contain nested expressions). * `FNormalize` receives the `BlockBuilder` as an argument, to allow context-dependent normalization. For example, an operator whose normalization requires in-line expressions may use `BlockBuilder::LookupBinding` to perform variable replacement. * `FNormalize` is applied after `FInferStructInfo`. `FNormalize` may assume that the `relax::Call` passed to `FNormalize` has well-defined struct info. * Corollary: `FInferStructInfo` may not assume that its `relax::Call` argument has been passed through `FNormalize`. This is a reasonable requirement, because (1) shape inference should depend only on the struct info of arguments and not the values themselves, and (2) this only impacts operators that use `FNormalize`. * `FNormalize` should not be used to apply simplifications, and should be limited to cases where the same computation may be expressed in multiple manners. For example, replacing a by-variable tuple with an in-line tuple in `R.call_tir` is a form of normalization, but replacing `R.add(arg, R.const(0))` with `arg` is a form of simplification. This separation is to ensure that `FNormalize` has minimal overhead, as some simplifications may have large computational costs, and `FNormalize` is applied as part of all `ExprMutator` usage. A later PR will introduce an attribute `FSimplify`, along with a dedicated pass to apply simplifications. * Use of `FNormalize` is suppressed while parsing TVMScript. TVMScript must be able to generate test cases that trigger specific failure modes, and that may include producing un-normalized relax IR. In addition, TVMScript must be stable when passed through a round-trip from IR to text to IR.

src/relax/analysis/well_formed.cc

tqchen · 2023-11-06T14:34:16Z

Thanks for the proposed change. I like how FNormalize can help reducing overhead of creating certain operators and bring them back to normal form.

I only have one comment on the wellform check side. In this case, it is useful to have an intentionally duplicated check that is different from FNormalize , e.g. have a TEnforceExplicitTupleInArgs attribute that enforces the tuple argument being unpacked, and check this condition. This provides extra layer of protection, makes the intention clear and is also more efficient

Lunderberg · 2023-11-06T15:37:14Z

Thank you, and I like the overall design. I think we still want to keep all the normalization logic in FNormalize, without adding boolean flags for specific cases. The more boolean flags we have, the more difficult it is for a developer to know the rules for all flags. For example, a developer would need to check if the TEnforceExplicitTupleInArgs is used as part of the well-formed check, whether it triggers an assert during normalization, whether it triggers a normalization step during normalization, whether the normalization step applies when parsing TVMScript, etc. By implementing each of these features on top of the same FNormalize functionality, new operator-specific normalization rules can be implemented without adding to a developer's mental overhead. A developer only needs to know that the new normalization is handled the same as all existing integrations.

(Also, see the other comment for performance benchmarking.)

tqchen · 2023-11-06T16:26:00Z

after thinking a bit more, i now agree that we can reuse FNormalize in wellform check. thanks for proposing the change

tqchen · 2023-11-06T16:26:24Z

src/relax/ir/block_builder.cc

+    // How much opt could an opt op Op if an opt op could op opt?
+    if (auto opt_op = op->op.as<Op>()) {
+      auto op = opt_op.value();
+      if (apply_f_normalize_ && op_map_normalize_.count(op)) {


We can use this function https://github.com/apache/tvm/blob/main/include/tvm/ir/op.h#L476

Thank you, and updated! I had checked for a single-parameter .get, and an iterator-style .find, but hadn't found the two-parameter .get.

Updated to use if (auto func_normalize = op_map_normalize_.get(call->op, nullptr); func_normalize != nullptr), here and in well_formed.cc.

tqchen · 2023-11-06T16:26:44Z

src/relax/analysis/well_formed.cc

+    // case it produced a nested expression.
+
+    if (auto opt_op = call->op.as<Op>()) {
+      auto op = opt_op.value();


https://github.com/apache/tvm/blob/main/include/tvm/ir/op.h#L476 we can directly use this function to simplofy the logic

op_map_normalize_.get(call->op, nullptr)

Thank you, and updated to use if (auto func_normalize = op_map_normalize_.get(call->op, nullptr); func_normalize != nullptr).

Lunderberg · 2023-11-06T17:22:01Z

Thank you, and changes made as suggested!

This was referenced Nov 3, 2023

[Unity] Implement FNormalize for relax.op.call_tir #16068

Merged

[Unity][Transform] Handle relax.Var as call_tir args when lowering #15916

Closed

Lunderberg force-pushed the unity_operator_specific_normalization branch from 82211e5 to 6b6a185 Compare November 3, 2023 19:27

Lunderberg force-pushed the unity_operator_specific_normalization branch from 6b6a185 to f4ec8a3 Compare November 3, 2023 20:22

tqchen reviewed Nov 6, 2023

View reviewed changes

src/relax/analysis/well_formed.cc Show resolved Hide resolved

Disable C++ lint on explicit zero-parameter constructor

ef6777a

tqchen reviewed Nov 6, 2023

View reviewed changes

Avoid double-lookup with map.count(op) then map[op]

f4ab501

Lunderberg mentioned this pull request Nov 6, 2023

[Draft][Unity] Allow dynamic indices to TupleGetItem #16002

Closed

tqchen approved these changes Nov 7, 2023

View reviewed changes

tqchen merged commit e506bff into apache:unity Nov 7, 2023
15 checks passed

Lunderberg deleted the unity_operator_specific_normalization branch November 7, 2023 14:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Unity] Implement FNormalize attribute for operators #16067

[Unity] Implement FNormalize attribute for operators #16067

Lunderberg commented Nov 3, 2023 •

edited

tqchen commented Nov 6, 2023

Lunderberg commented Nov 6, 2023

tqchen commented Nov 6, 2023

tqchen Nov 6, 2023

Lunderberg Nov 6, 2023

tqchen Nov 6, 2023

tqchen Nov 6, 2023

Lunderberg Nov 6, 2023

Lunderberg commented Nov 6, 2023

[Unity] Implement FNormalize attribute for operators #16067

[Unity] Implement FNormalize attribute for operators #16067

Conversation

Lunderberg commented Nov 3, 2023 • edited

tqchen commented Nov 6, 2023

Lunderberg commented Nov 6, 2023

tqchen commented Nov 6, 2023

tqchen Nov 6, 2023

Choose a reason for hiding this comment

Lunderberg Nov 6, 2023

Choose a reason for hiding this comment

tqchen Nov 6, 2023

Choose a reason for hiding this comment

tqchen Nov 6, 2023

Choose a reason for hiding this comment

Lunderberg Nov 6, 2023

Choose a reason for hiding this comment

Lunderberg commented Nov 6, 2023

Lunderberg commented Nov 3, 2023 •

edited