[QNN] Concat - Refactoring to C++ #3819

anijain2305 · 2019-08-22T16:03:59Z

There are 2 reasons for moving to C++

With python interface earlier, we had to call infer_type in the op python definition. This broke the nice abstraction that we have with Relay and QNN ops.
Working with different targets, I realized that we need to write QNN passes, that will work on the abstraction of QNN ops. For example, Intel needs the conv to be uint8 x int8. However, TFLite graphs can be uint8 x uint8. This can resolved by writing a legalize pass for QNN conv2d op for Intel machines, that will insert a requantize on the weight matrix to go from uint8 x int8. With earlier python interface, the ops were already getting lowered to relay ops, before I could even run this pass.

Please review @yzhliu @jackwish @FrozenGene @vinx13 @tqchen @zhiics

anijain2305 · 2019-08-30T17:23:44Z

@vinx13 @zhiics @yzhliu Can you please review this?

Refactoring to C++ removes InferType dependency. Relevant comment - #3730 (comment)

vinx13 · 2019-08-30T18:39:35Z

src/relay/pass/pattern_util.h

@@ -415,6 +415,13 @@ static inline Expr Full(Expr fill_value,
  return CallNode::make(op, {fill_value}, Attrs(attrs), {});
 }

+static inline Expr Concatenate(Expr data, int axis) {


this is actually the same as MakeConcatenate below isn't it?

Yup. Nice observation. Made the changes to use that.

src/relay/qnn/util.h

zhiics · 2019-08-30T19:52:23Z

src/relay/qnn/op/concatenate.cc

+                        Array<tvm::Expr> input_zero_points, double output_scale,
+                        int32_t output_zero_point, int axis) {
+  auto attrs = make_node<QnnConcatenateAttrs>();
+  attrs->input_scales = input_scales;


Suggested change

attrs->input_scales = input_scales;

attrs->input_scales = std::move(input_scales);

zhiics · 2019-08-30T19:52:42Z

src/relay/qnn/op/concatenate.cc

+                        int32_t output_zero_point, int axis) {
+  auto attrs = make_node<QnnConcatenateAttrs>();
+  attrs->input_scales = input_scales;
+  attrs->input_zero_points = input_zero_points;


Suggested change

attrs->input_zero_points = input_zero_points;

attrs->input_zero_points = std::move(input_zero_points);

zhiics · 2019-08-30T19:55:04Z

src/relay/qnn/op/concatenate.cc

+
+/*
+ * \brief Canonicalizes the QNN concatenate op.
+ * \param ref_call The original call that will be lowered.


Names in comment mismatch with the arguments.

zhiics · 2019-08-30T19:59:38Z

src/relay/qnn/util.h

+                     const Array<IndexExpr>& input_shape, const DataType& out_dtype);
+
+static inline Expr Requantize(const Expr& data, const Array<IndexExpr>& input_shape,
+                              const double& input_scale, const int32_t& input_zero_point,


no need to have const xx& for built-in types, just double input_scale, int32_t input_zero_point etc.

anijain2305 · 2019-08-30T20:34:13Z

Thanks @zhiics and @vinx13 for the comments. The changes are in.

zhiics

lgtm

anijain2305 force-pushed the qnn_concat branch from 41c82e4 to 6b35322 Compare August 23, 2019 06:30

zhiics mentioned this pull request Aug 23, 2019

[Relay] Refactor - Move infer types to a header file. #3783

Merged

anijain2305 force-pushed the qnn_concat branch 6 times, most recently from ea98355 to 5679038 Compare August 30, 2019 17:21

vinx13 requested changes Aug 30, 2019

View reviewed changes

anijain2305 force-pushed the qnn_concat branch from 5679038 to b1e43d2 Compare August 30, 2019 19:34

zhiics requested changes Aug 30, 2019

View reviewed changes

[QNN] Concat - Refactoring to C++

cc55ce8

anijain2305 force-pushed the qnn_concat branch from b1e43d2 to cc55ce8 Compare August 30, 2019 20:24

vinx13 approved these changes Aug 31, 2019

View reviewed changes

zhiics approved these changes Aug 31, 2019

View reviewed changes

zhiics merged commit ec7790e into apache:master Aug 31, 2019

zhiics added the status: accepted label Aug 31, 2019

zhenhuaw-me mentioned this pull request Sep 2, 2019

[QNN] Add operator #3736

Merged

wweic pushed a commit to wweic/tvm that referenced this pull request Sep 16, 2019

[QNN] Concat - Refactoring to C++ (apache#3819)

f286483

wweic pushed a commit to wweic/tvm that referenced this pull request Sep 16, 2019

[QNN] Concat - Refactoring to C++ (apache#3819)

54b1ae3

wweic pushed a commit to neo-ai/tvm that referenced this pull request Sep 16, 2019

[QNN] Concat - Refactoring to C++ (apache#3819)

11d62df

tqchen mentioned this pull request Nov 8, 2019

[RELEASE][DRAFT] TVM v0.6 Release candidate #4259

Closed

anijain2305 deleted the qnn_concat branch November 13, 2019 00:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QNN] Concat - Refactoring to C++ #3819

[QNN] Concat - Refactoring to C++ #3819

anijain2305 commented Aug 22, 2019 •

edited

anijain2305 commented Aug 30, 2019

vinx13 Aug 30, 2019

anijain2305 Aug 30, 2019

zhiics Aug 30, 2019

zhiics Aug 30, 2019

zhiics Aug 30, 2019

zhiics Aug 30, 2019

anijain2305 commented Aug 30, 2019

zhiics left a comment

	attrs->input_scales = input_scales;
	attrs->input_scales = std::move(input_scales);

	attrs->input_zero_points = input_zero_points;
	attrs->input_zero_points = std::move(input_zero_points);

[QNN] Concat - Refactoring to C++ #3819

[QNN] Concat - Refactoring to C++ #3819

Conversation

anijain2305 commented Aug 22, 2019 • edited

anijain2305 commented Aug 30, 2019

vinx13 Aug 30, 2019

Choose a reason for hiding this comment

anijain2305 Aug 30, 2019

Choose a reason for hiding this comment

zhiics Aug 30, 2019

Choose a reason for hiding this comment

zhiics Aug 30, 2019

Choose a reason for hiding this comment

zhiics Aug 30, 2019

Choose a reason for hiding this comment

zhiics Aug 30, 2019

Choose a reason for hiding this comment

anijain2305 commented Aug 30, 2019

zhiics left a comment

Choose a reason for hiding this comment

anijain2305 commented Aug 22, 2019 •

edited